Query Match 3.2%; Score 87.5; DB 7; Length 1197; 

. Best Local Similarity.. 22.0%; Pred. No. 73; 
RESULT 1413 

ID AD019438 standard; protein; 1197 AA. 

DE Human PRO polypeptide #184. 

PN WO2004043361-A2. 

PD 27-MAY-2004. 

PA (GETH ) GENENTECH INC 

Query Match 3.2%; Score 87.5; DB 8; Length 1197; 

Best Local Similarity 22.0%; Pred. No. 73; 
RESULT 1414 

ID AAM79159 standard; protein; 1452 AA. 

DE Human protein SEQ ID NO 1821. 

PN WO200157190-A2. 

PD 09-AUG-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 3.2%; Score 87.5; DB 4; Length 1452; " 

Best Local Similarity 25.9%; Pred. No. 99; 
RESULT 1415 

ID ABR58629 standard; protein-; 1452 AA. 

DE Human cancer related protein SEQ ID NO: 286. 

PN WO2003025138-A2 . 

PD 27-MAR-2003. 

PA (EOSB-) EOS BIOTECHNOLOGY INC. 

Query Match 3.2%; Score 87.5; DB 6; Length 1452; 

Best Local Similarity 25.9%; Pred. No. 99; 
RESULT 1416 

ID ADJ68277 standard; protein; 1452 AA. 

DE Human heat mitochondrial protein as a therapeutic target SeqID83. 

PN WO2003087768-A2. 

PD 23-OCT-2003. 

PA (MITO-) MITOKOR. 

PA (BUCK-) BUCK INST AGE RES. 

Query Match 3.2%; Score 87.5; DB 7; Length 1452; 

Best Local Similarity 25.9%; Pred. No,. 99; 
RESULT 1417 

ID ADI80761 standard; protein; 1452 AA. 

DE Human protein tyrosine phosphatase receptor type mu protein sequence. 

PN US2004014699-A1. 

PD 22-JAN-2004. 

PA (ISIS-) ISIS PHARM INC. 

Query Match 3.2%; Score 87.5; DB 8; Length 1452; 

Best Local Similarity 25.9%; Pred. No. 99; 
RESULT 1418 

ID AAM80143 standard; protein; 1455 AA. 

DE Human protein SEQ ID NO 3789. 

PN WO200157190-A2. 

PD 09-AUG-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 3.2%; Score 87.5; DB 4; Length 1455; 

Best Local Similarity 25.9%; Pred. No. 99; 
RESULT 1419 

ID ABR53450 standard; protein; 1597 AA. 

DE Protein sequence #SEQ ID 1765. 

PN EP1258494-A1. 

PD 20-NOV-2002. 

PA (CELL-) CELLZOME AG. 



Query Match 3.2%; Score 87.5; DB 6; Length 1597; 

Best Local Similarity 22.1%; Pred. No. l.le+02; , 
RESULT 1420 

ID ADK64624 standard; protein; 1597 AA. 

DE Disease treating protein complex-derived protein #1063. 

PN EP1338608-A2. 

PD 27-AUG-2003. 

PA (CELL-) CELLZOME AG. 

Query Match 3.2%; Score 87.5; DB 7; Length 1597; 

Best Local Similarity 22.1%; Pred. No. l.le+02; 
RESULT 1421 

ID AD044172 standard; protein; 6885 AA. 

DE Structural and cytoskeleton-associated polypeptide #8. 

Query Match 3.2%; Score 87.5; DB 8; Length 6885; 

Best Local Similarity 18.6%; Pred. No. l.le+03; 
RESULT 1422 

ID AAM21511 standard; protein; 69 AA. 

DE Peptide #7945 encoded by probe for measuring cervical gene expression. 
PN WO200157278-A2. 
PD 09-AUG-2001. 

PA (MOLE- ) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1423 

ID ABB43855 standard; peptide; 69 AA. 

DE Peptide #11361 encoded by human foetal liver single exon probe. 
PN WO200157277-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1424 

ID AAM37766 standard; protein; 69 AA. 

DE Peptide #11803 encoded by probe for measuring placental gene expression. 
PN WO200157272-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1425 

ID ABB26783 standard; protein; 69 AA. 

DE Protein #8782 encoded by probe for measuring heart cell gene expression. 
PN WO200157274-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1426 

ID AAM77581 standard; protein; 69 AA. 

DE Human bone marrow expressed probe encoded protein SEQ ID NO: 37887. 
PN WO200157276-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1427 

ID AAM64832 standard; protein; 69 AA. 



Length 69; 



DE Human brain expressed single exon probe encoded protein SEQ ID NO: 36937. 
PN . WO200157275-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1428 

ID ABG59228 standard; peptide; 69 AA. 
DE Human liver peptide, SEQ ID No 37876. 
PN WO200157273-A2. 
PD 09-AUG-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 4; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1429 

ID ABG46614 standard; peptide; 69 AA. 

DE Human peptide encoded by genome-derived single exon probe SEQ ID 36279. 
PN WO200186003-A2. 
PD 15-NOV-2001. 

PA (MOLE-) MOLECULAR DYNAMICS INC. 

Query Match 3.1%; Score 87; DB 5; Length 69; 

Best Local Similarity 29.2%; Pred. No. 0.96; 
RESULT 1430 

ID ABP25680 standard; protein; 250 AA. 
DE Streptococcus polypeptide SEQ ID NO 536. 
PN WO200234771-A2. 
PD 02-MAY-2002. 
PA (CHIR-) CHIRON SPA. 
PA (GENO-) INST GENOMIC RES. 

Query Match 3.1%; Score 87; DB 5; 

Best Local Similarity 24.7%; Pred. No. 7.1; 
RESULT 1431 



Length 250; 



ID AAW72573 standard; protein; 356 AA. 

DE Human glycosaminoglycan sulphate group transferase. 
PN JP10257896-A. 
PD 29-SEP-1998. 

PA (SEGK ) SEIKAGAKU KOGYO CO LTD. 

Query Match 3.1%; Score 87; DB 2; Length 356; 

Best Local Similarity 20.2%; Pred. No. 12; 
RESULT 1432 

ID AAB94514 standard; protein; 356 AA. 

DE Human protein sequence SEQ ID NO: 15229. 

PN EP1074617-A2. 

PD 07-FEB-2001. 

PA (HELI-) HELIX RES INST. 

Query Match 3.1%.; Score 87; DB 4; Length 356; 

Best Local Similarity 20.2%; Pred. No. 12; 
RESULT 1433 

ID ABB82859 standard; protein; 356 AA. 
DE HS2ST related polypeptide (GenBank 
PN WO200299138-A2. 
PD 12-DEC-2002. 
PA (EXEL-) EXELIXIS INC. 

Query Match 3.1%; Score 

Best Local Similarity 20.2%; Pred. 
RESULT 1434 ' 

ID ABB82860 standard; protein; 356 AA. 



Identifier No. GI#6683564) . 



87; 
No. 



DB 
12; 



6; Length 356; 



DE HS2ST related polypeptide (GenBank Identifier No. GI#6912420) 

PN WO200299138-A2. 

PD 12-DEC-2002. 

PA (EXEL-) EXELIXIS INC. 

Query Match 3.1%; Score 87; DB 6; Length 356; 

Best Local Similarity 20.2%; Pred. No. 12; 
RESULT 1435 

ID ABU62934 standard; protein; 356 AA. 

Human heparan sulphate 2-O-sulphotrans f erase HS2ST #1. 
US2003013144-A1. 
16-JAN-2003. 
(FRIE/) FRIEDMAN L. 

PLOWMAN G D. 
BELVIN M. 
FRANCIS-LANG H. 
LI D. 



DE 
PN 
PD 
PA 
PA 
PA 
PA 
PA 
PA 



PLOW/) 
(BELV/) 
(FRAN/) 
(LIDD/) 

(FUNK/) FUNKE R P. 
Query Match 3.1%; Score 87; DB 6; Length 356; 

Best Local Similarity 20.2%; Pred. No. 12; 
RESULT 1436 

ID ABU62935 standard; protein; 356 AA. 

Human heparan sulphate 2-O-sulphotransf erase HS2ST #2. 
US2003013144-A1. 



DE 
PN 
PD 
PA 
PA 
PA 
PA 
PA 
PA 



16-JAN-2003. 
(FRIE/) FRIEDMAN L. 

PLOWMAN G D. 
BELVIN M. 
FRANCIS-LANG H. 
LI D. 

FUNKE R P. 



Score 87; DB 6; 
Pred. No. 12; 



(PLOW/) 
(BELV/) 
(FRAN/) 
(LIDD/) 
(FUNK/) 

Query Match 3.1%; 
Best Local Similarity 20.2%; 
RESULT 1437 

ID AAY36994 standard; protein; 431 AA. 

Chlamydia trachomatis lipoprotein sequence. 
W09928475-A2. 
10-JUN-1999. 
(GEST ) GENSET. 
Query Match 3.1%; Score 87; DB 2; 

Best Local Similarity 20.8%; Pred. No. 17; 
RESULT 1438 

ID AAW98588 standard; protein; 488 AA. 
H. pylori GHPO 87 protein. 
W09843478-A1. 
08-OCT-1998. 

(INMR ) MERIEUX ORAVAX PASTEUR MERIEUX.. SERUMS 
(HUMA-) HUMAN GENOME SCI INC. 
Query Match 3.1%; Score 87; DB 2; 

Best Local Similarity 19.5%; Pred. No. 20; 
RESULT 1439 

ID AAR10378 standard; protein; 514 AA. 
Mutant alpha-amylase gene product. 
EP409299-A. 
23-JAN-1991. 

(KONN ) GIST-BROCADES NV. 
Query Match 3.1%; Score 87; DB 

Best Local Similarity 21.8%; Pred. No. 22; 



Length 356; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 
PA 



DE 
PN 
PD 
PA 



Length 431; 



Length 488; 



2; Length 514; 



RESULT 144 0 



ID .AAB12430 standard; protein; 

DE Bacillus amyloliquef aciens 

PN JP2000135093-A. 

PD 16-MAY-2000. 

PA (DAIW ) DAIWA KASEI KK. 

Query Match 3.1%; Score 87; DB 

Best Local Similarity 22.3%; Pred. No. 22; 

RESULT 1441 

ID AAB68551 standard; protein; 551 AA. 

DE Human GTP-binding associated protein #51. 

PN WO200105970-A2. 

PD 25-JAN-2001. 

PA (INCY-) INCYTE GENOMICS INC. 

Query Match 3.1%; Score 87; DB 

Best Local Similarity 19.7%; Pred. No. 25; 

RESULT 1442 

ID AAB94384 standard; protein; 551 AA. 

DE Human protein sequence SEQ ID NO: 14939. 

PN EP1074617-A2. 

PD 07-FEB-2001. 

PA (HELI-) HELIX RES INST. 

Query Match 3.1%; Score 87; DB 

Best Local Similarity 19.7%; Pred. No. 25; 

RESULT 144 3 



514 AA. 

clone number 22 protein SEQ ID NO 



3; Length 514; 



4; Length 551; 



4; Length 551; 



ID ADP54518 standard; protein; 

DE Human PRO protein sequence 

PN WO2004039956-A2. 

PD 13-MAY-2004. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 19.7%; 
RESULT 1444 



551 AA. 
SEQ ID NO: 494. 



Score 
Pred. 

565 AA. 



87; 
No. 



DB 8; 
25; 



Length 551; 



ID AAB14796 standard; protein; 
DE TGF-beta type II receptor. 
PN US6093547-A. 
PD 25-JUL-2000. 

PA (CREA-) CREATIVE BIOMOLECULES INC. 
Query Match 3.1%; Score 

Best Local Similarity 23.2%; Pred. 

RESULT 1445 



87; DB 
No. 25; 



3; Length 565; 



ID ADN61899 standard; protein; 

DE Human novel protein NOV56a. 

PN US2004043382-A1. 

PD 04-MAR-2004 . 

PA (PADI/) PAD'IGARU M. 

PA (SPYT/) SPYTEK K A. 

PA (SHEN/) SHENOY S G. 

PA (TAUP/) TAUPIER R J. 

PA (PENA/) PEN A C E A. 

PA (LILL/) LI L. 

PA (ZERH/) ZERHUSEN B D. 

PA (GUSE/) GUSEV V Y. 

PA (JIWW/) JI W. 

PA (GORM/) GORMAN L. 

PA (MILL/) MILLER C E. 

PA (KEKU/) KEKUDA R. 



577 AA. 



PA 


( PATT/ ) 


PATTURAJAN M. 


PA 


(GANG/) 


GANGOLLI E A. 


PA 


(VERN/) 


VERNET CAM. 


PA 


(GUOX/) 


GUO X S. 


PA 


(TCHE/) 


TCHERNEV V T. 


PA 


(FERN/) 


FERN ANDES E R. 


PA 


(CASM/) 


CASMAN S J. 


PA 


(MALY/) 


MALYANKAR U M. 


PA 


(GERL/) 


GERLACH V. 


PA 


(LIUY/) 


LIU Y. 


PA 


(ANDE/) 


ANDERSON D W. 


PA 


(SPAD/) 


SPADERNA S K. 


PA 


(CATT/) 


CATTERTON E. 


PA 


(LEIT/) 


LEITE M W. 


PA 


(ZHON/) 


ZHONG H. 


PA 


(ALSO/) 


ALSOBROOK J P. 


PA 


(LEPL/) 


LEPLEY D M. 


PA 


(RIEG/) 


RIEGER D K. 


PA 


(BURG/) 


BURGESS C E. 


Query Match 



Best 
RESULT 
ID 



Local 
1446 



Similarity 



3.1%; 
19.6%; 



Score 
Pred. 



87; 
No. 



DB 8; 
26; 



Length 577; 



Length 930; 



AAW83995 standard; protein; 893 AA. 
DE Tne DNA polymerase mutant D137A, D323A, R722N, F730Y. 
PN WO9835060-A1. 
PD 13-AUG-1998. 

PA (LIFE-) LIFE TECHNOLOGIES INC. 

Query Match 3.1%; Score 87; DB 2; Length 893; 

Best Local Similarity 19.7%; Pred. No. 52; 
RESULT 1447 

ID ADC37548 standard; protein; 930 AA. 

DE Human nucleic acid associated protein, NAAP-15. 

PN WO2003046151-A2. 

PD 05-JUN-2003. 

PA (INCY-) INCYTE GENOMICS INC. 

Query Match 3.1%; Score 87; DB 7; 

Best Local Similarity 19.9%; Pred. No. 55; 
RESULT 1448 

ID ADC95237 standard; protein; 944 AA. 
DE E. faecium protein sequence SEQ ID 4864. 
PN US6583275-B1. 
PD 24-JUN-2003. 

PA (GENO-) GENOME THERAPEUTICS CORP. 

Query Match 3.1%; Score 87; DB 7; 

Best Local Similarity 19.1%; Pred. No. 57; 
RESULT 1449 

ID ABP27459 standard; protein; 1034 AA. 
DE Streptococcus polypeptide SEQ ID NO 4094. 
PN WO200234771-A2. 
PD 02-MAY-2002. 
PA (CHIR-) CHIRON SPA. 
PA (GENO-) INST GENOMIC RES. 

Query Match 3.1%; Score 87; DB 

Best Local Similarity 18.7%; Pred. No. 65; 
RESULT 1450 

ID AAU35929 standard; protein; 1167 AA. 
DE 



Length 944; 



5; Length 1034; 



Helicobacter pylori cellular proliferation protein #242. 



DB 4; 
79; 



PN WO200170955-A2. 
PD 27-SEP-2001. 
PA (ELIT-) ELITRA PHARM INC. 

Query Match 3.1%; 

Best Local Similarity 18.8%; 
RESULT 1451 

ID ABR53378 standard; protein; 
DE Protein sequence #SEQ ID 1621 
PN EP1258494-A1. 
PD 20-NOV-2002. 
PA (CELL-) CELLZOME AG. 

Query Match 3.1%; 

Best Local Similarity 24.5%; 
RESULT 1452 

ID ADK63342 standard; protein; 
DE Disease treating protein complex-derived protein #982 
PN EP1338608-A2. 
PD 27-AUG-2003. 
PA. (CELL-) CELLZOME AG. 

Query Match 3.1%; 

Best Local Similarity 24.5%; 
RESULT 1453 

ID ABR47539 standard; protein; 



Score 87; 
Pred. No. 

1411 AA. 



Score 87; 
Pred. No. 

1411 AA. 



Score 87; 
Pred. No. 

1855 AA. 



Length 1167; 



DB 6; 
1. le+02; 



Length 1411; 



DB 7; Length 1411; 
1. le+02; 



DE 
PN 
PD 
PA 



Breast cancer associated protein sequence SEQ ID NO: 315. 
WO2003004989-A2. 



DE' 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Score 87; 
Pred. No. 

1855 AA. 



Score 87; 
Pred. No. 



DB 6; Length 1855; 
1.6e+02; 



DB 8; Length 1855; 
1.6e+02; 



16-JAN-2003. 

(MILL-) MILLENIUM PHARM INC. 
Query Match 3.1%; 
Best Local Similarity 22.1%; 
RESULT 1454 

ID ADN04056 standard; protein; 

Antipsoriatic protein sequence #223, 
WO2004028479-A2. 
08-APR-2004 . 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 22.1%; 
RESULT 1455 

ID ABB67961 standard; protein; 293 AA. 

Drosophila melanogaster polypeptide SEQ ID NO 30675. 
WO200171042-A2. 
27-SEP-2001. 
(PEKE ) PE CORP NY. 
Query Match 3.1%; Score 86.5; DB 4; Length 293; 

Best Local Similarity 21.1%; Pred. No. 10; 
RESULT 1456 

ID AAU38041 standard; protein; 352 AA. 

Streptococcus pneumoniae cellular proliferation protein #470. 
WO200170955-A2. 



DE 
PN 
PD 
PA 



27-SEP-2001. 

(ELIT-) ELITRA PHARM INC. 
Query Match 3.1%; Score 86.5; DB 4; Length 352; 

Best Local Similarity 29.2%; 
RESULT 1457 

ID ABU23929 standard; protein; 

DE Protein encoded by Prokaryotic essential gene #9456 
PN WO200277183-A2. 



Score 86.5; DB 4; 
Pred. No. 14; 

359 AA. 



PD 03-OCT-2002. 

PA (ELIT-) ELITRA PHARM INC. 

Query Match 3.1%; Score 86.5; DB 6; Length 359; 

Best Local Similarity 18.7%; Pred. No. 14; 
RESULT 1458 

ID AAG42135 standard; protein; 369 AA. 

DE Arabidopsis thaliana protein fragment SEQ ID NO: 52510. 
PN EP1033405-A2. 
PD 06-SEP-2000. 

Query Match 3.1%; Score 86.5; DB 3; Length 369; 

Best Local Similarity 21.8%; Pred. No. 15; 
RESULT 1459 

ID AAB61228 standard; protein; 498 AA. 
DE Human TANGO 325 extracellular domain. 
PN WO200100638-A2. 
PD 04-JAN-2001. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 498; 

Best Local Similarity 21.3%; Pred. No. 24; 
RESULT 1460 

ID ADB90766 standard; protein; 498 AA. 
DE Human TANGO 325 extracellular domain. 
PN US2003082586-A1. 
PD 01-MAY-2003. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 7; Length 498; 

Best Local Similarity 21.3%; Pred. No. 24; 
RESULT 1461 

ID ADF71501 standard; protein; 498 AA. 
DE Human TANGO 325 extracellular domain. 
PN US2003175733-A1. 
PD 18-SEP-2003. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 7; Length 498; 

Best Local Similarity 21.3%; Pred. No. 24; 
RESULT 1462 

ID ADM42026 standard; protein; 498 AA. 

DE Human TANGO 325 extracellular domain. 

PN US2003170621-A1. 

PD ll-SEP-2003. 

PA (MCCA/) MCCARTHY S A. 

PA (FRAS/) FRASER C C. 

PA (SHAR/) SHARP J D. 

PA (BARN/) BARNES T M. 

PA (KIRS/) KIRST S J. 

PA (MYER/) MYERS PS. ... 

PA (WRIG/) WRIGHTON N. 

PA (GOOD/) GOODEARL ADJ. 

PA (HOLT/) HOLTZMAN D A. 

PA (KHOD/) KHODADOUST M. 

Query Match 3.1%; Score 86.5; DB 7; Length 498; 

Best Local Similarity 21.3%; Pred. No. 24; 
RESULT 14 63 

ID ADI36908 standard; protein; 498 AA. 
DE Human LRR protein #3. 
PN US2003220263-A1. 
PD 27-NOV-2003. 



PA (FEDE/) FEDER J N. 
PA (MINT/) MINTIER G. 
PA (RAMA/) RAMAN AT HAN C S. 

Query Match 3.1%; Score 86.5; DB 8; Length 498; 

Best Local Similarity 21.3%; Pred. No. 24; 
RESULT 1464 * 

ID AAY35078 standard; protein; 542 AA. 

DE Chlamydia ^pneumoniae protein not found in C. trachomatis. 
PN WO9927105-A2. 
PD 03-JUN-1999. 
PA (GEST ) GENSET. 

Query Match 3.1%; Score 86.5; DB 2; Length 542; 

Best Local Similarity 19.9%; Pred. No. 27; 
RESULT 1465 

ID AAB61227 standard; protein; 591 AA. 
DE Mature human TANGO 325 protein. 
PN WO200100638-A2. 
PD 04-JAN-2001. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 591; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1466 

ID ADB90765 standard; protein; 591 AA. 
DE Human TANGO 325 mature protein. 
PN US2003082586-A1. 
PD 01-MAY-2003. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 7; Length 591; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1467 

ID ADF71500 standard; protein; 591 AA. 
DE Human TANGO 325 mature protein. 
PN US2003175733-A1. 
PD 18-SEP-2003. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 7; Length 591; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1468 

591 AA. 



ID 


ADM42025 standard; 


protein; 


DE 


Mature 


human TANGO 


325. 


PN 


US2003170621-A1. 




PD 


11-SEP- 


2003. 




PA 


(MCCA/) 


MCCARTHY S 


A. 


PA 


(FRAS/) 


FRASER C C 




PA 


(SHAR/) 


SHARP J D. 




PA 


(BARN/) 


BARNES T M 




PA 


(KIRS/) 


KIRST S J. 




PA 


(MYER/) 


MYERS P S. 




PA 


(WRIG/) 


WRIGHTON N 




PA 


(GOOD/) 


GOODEARL A 


D J. 


PA 


(HOLT/) 


HOLTZMAN D 


A. 


PA 


(KHOD/) 


KHODADOUST 


M. 



Query Match 3.1%; Score 8 6.5; DB 7; Length 591; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1469 

ID ADI36907 standard; protein; 591 AA. 
DE Human LRR protein #2. 



PN US2003220263-A1. 

PD . 27-NOV-2003. 

PA (FEDE/) FEDER J N. 

PA (MINT/) MINTIER G. 

PA (RAMA/) RAMAN AT HAN C S. 

Query Match 3.1%; Score 86.5; DB 8; Length 591; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1470 

ID AAE06798 standard; protein; 597 AA. 

DE Mature human neuronal guidance molecule (NGM) -like protein #1. 

PN WO200157262-A1. 

PD 09-AUG-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 597; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1471 

ID ADI36910 standard; protein; 597 AA. 

DE Human LRR protein #5. 

PN US2003220263-A1. 

PD 27-NOV-2003. 

PA (FEDE/) FEDER J N. 

PA (MINT/) MINTIER G. 

PA (RAMA/) RAMANATHAN C S. 

Query Match 3.1%; Score 86.5; DB 8; Length 597; 

Best Local Similarity 21.3%; Pred. No. 31; 
RESULT 1472 

ID ADS23090 standard; protein; 613 AA. 

DE Bacterial polypeptide #12123. 

PN US2003233675-A1. . 

PD 18-DEC-2003. 

PA (CAOY/) CAO Y. 

PA (HINK/) HINKLE G J. 

PA (SLAT/) SLATER S C. 

PA (CHEN/) CHEN X. 

PA (GOLD/) GOLDMAN B S. 

Query Match 3.1%; Score 86.5; DB 8; Length 613; 

Best Local Similarity 18.8%; Pred. No. 33; 
RESULT 1473 

ID AAB61225 standard; protein; 622 AA. 
DE Human TANGO 325 protein. 
PN WO200100638-A2 . 
PD 04-JAN-2001. 

PA (MILL-) MILLENNIUM PHARM INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1474 

ID AAU12261 standard; protein; 622 AA. 

DE Human PR04337 polypeptide sequence. 

PN WO200140466-A2. 

PD 07-JUN-2001. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1475 

ID AAU20531 standard; protein; 622 AA. 
DE Human secreted protein, Seq ID No 523. 
PN WO200155326-A2. 



PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 147 6 

ID AAB45703 standard; protein; 622 AA. 

DE Human 7TM clone HDTIE58 protein fragment #1. 

PN WO200071584-A1. 

PD 30-NOV-2000. 

PA (HUMA-) HUMAN GENOME SCI INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1477 

ID AAU19922 standard; protein; 622 AA. 

DE Novel human calcium-binding protein #31. 

PN WO200155304-A2. 

PD 02-AUG-2001. 

PA (HUMA-) HUMAN GENOME SCI INC.- 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1478 

ID AAE06789 standard; protein; 622 AA. 

DE Human neuronal guidance molecule (NGM) -like protein #1. 

PN WO200157262-A1. 

PD 09-AUG-2001. 

PA (HYSE-) HYSEQ INC. 

Query Match 3.1%; Score 86.5; DB 4; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1479 

ID ABO17705 standard; protein; 622 AA. 

DE Novel human secreted and transmembrane protein PR04337. 

PN US2003032156-A1. 

PD 13 1 FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 6; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1480 

ID ABU80959 standard; protein; 622 AA. 

DE Human PRO polypeptide #90. 

PN US2003004311-A1. 

PD 02-JAN-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 6; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1481 

ID ABU66659 standard; protein; 622 AA. 

DE Human PRO polypeptide #90. 

PN US2003036180-A1. 

PD 20-FEB-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 6; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1482 

ID ABU59740 standard; protein; 622 AA. 

DE Novel secreted and transmembrane protein PR04337. 

PN US2003017563-A1. 

PD 23-JAN-2003. 



Score 86.5; DB 6; 
Pred. No. 33; 



Length 622; 



Score 86.5; DB 6; 
Pred. No. 33; 



Length 622; 



Score 86.5; DB 6; 
Pred. No. 33; 



Length 622; 



Score 86. 5; DB 6; 
Pred. No. 33; 



Length 622; 



PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 21.3%; 
RESULT 1483 

ID ABO24930 standard; protein; 622 AA. 
DE Human secreted/transmembrane protein (PRO) #90 
PN US2003036179-A1. 
PD 20-EEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 21.3%; 
RESULT 1484 

ID ABU66935 standard; protein; 622 AA. 

DE Human secreted/transmembrane, PRO, protein SEQ ID 180. 
PN US2003032155-A1. 
PD 13-FEB-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 21.3%; 
RESULT 1485 

ID ADA45699 standard; protein; 622 AA. 

Novel human secreted and transmembrane protein PR04337. 
US2003022328-A1. / 
30-JAN-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1486 

ID ADA76130 standard; protein; 622 AA 
Human PRO .polypeptide #90. 
US2003073212-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1487 

ID ADA18780 standard; protein; 622 AA. 
Human PRO polypeptide #90. 
US2003054517-A1. 
20-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; Score 86.5; DB 6; Length 622; 

Best Local Similarity 21.3%; 
RESULT 1488 

ID ADA61403 standard; protein; 
Homo sapiens. 
US2003049816-A1. 
13-MAR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; Score 86.5; DB 6; Length 622; 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1489 

ID ADB19188 standard; protein; 622 AA. 

DE Novel human secreted and transmembrane protein PR04337. 

PN US2003068796-A1. 

PD 10-APR-2003. 

PA (GETH ) GENENTECH INC. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Score 86.5; DB 6; 
Pred. No. 33; 



Length 622; 



Score 86.5; DB 6; 
Pred. No. 33; 

622 AA. 



Score 86.5; DB 6; Length 622; 



Pred. No. 33.; 



622 AA. 



Score 86.5; DB 6; 
Pred. No. 33; 

622 AA. 



PR04337, 



Score 86.5; DB 6; Length 622; 
Pred. No. 33; 



622 AA. 



Score 86.5; DB 6; 
Pred. No. 33; 

622 AA. 



Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1490 

ID ADB27729 standard; protein; 
DE Human PRO polypeptide #90. 
PN US2003082704-A1. 
PD 01-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 21.3%; 
RESULT 1491 

ID ADA86208 standard; protein; 
DE Novel human secreted and transmembrane protein 
PN US2003082711-A1. 
PD 01-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; 

Best Local Similarity 21.3%; 
RESULT 14 92 

ID ADB15772 standard; protein; 
Human PRO polypeptide #90. 
US2003087350-A1. 
08-MAY-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1493 

ID ADA47558 standard; protein; 
Human PRO polypeptide #90. 
US2003073215-A1. 
17-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1494 

ID ADA67353 standard; protein; 
Human PRO polypeptide #90. 
US2003068795-A1. 
10-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1495 

ID ADB30360 standard; protein; 
Human PRO polypeptide #90. 
US2003068794-A1. 
10-APR-2003. 
(GETH ) GENENTECH INC. 
Query Match 3.1%; 
Best Local Similarity 21.3%; 
RESULT 1496 

ID ADA85656 standard; protein; 
DE Novel human secreted and transmembrane protein 
PN US2003082693-A1. 
PD 01-MAY-2003. 
PA (GETH ) GENENTECH INC. 



Length 622; 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Length 622; 



Score 86.5; DB 6; Length 622; 
Pred. No. 33; 



622 AA. 



DE 
PN 
PD 
PA 



DE 
PN 
PD 
PA 



Score 86.5; DB 6; 
Pred. No. 33; 

622 AA. 



Score 86.5; DB 6; 
Pred. No. 33; 

622 AA. 



Length 622; 



Length 622; 



PR04337. 



Query Match 



3.1%; Score 86.5; DB 6; Length 622; 



Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1497 

ID ADA96868 standard; protein; 622 AA. 

DE Human PRO polypeptide #90. 

PN US2003082705-A1. 

PD 01-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 

Best Local Similarity 21.3%; Pred. No. 33; 

RESULT 1498 

ID ADA79172 standard; protein; 622 AA. 

DE Human PRO polypeptide #90. 

PN US2003082763-A1. 

PD 01-MAY-2003. 

PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 

Best Local Similarity 21.3%;, Pred. No. 33; 

RESULT 1499 

ID 



6; Length 622; 



6; Length 622; 



6; Length 622; 



ADA87311 standard; protein; 622 AA. 
DE Novel human secreted and transmembrane protein PR04337. 
PN US2003087345-A1. 
PD 08-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 86.5; DB 

Best Local Similarity 21.3%; Pred. No. 33; 
RESULT 1500 

ID ADB16513 standard; protein; 622 AA. 
DE Human PRO polypeptide #90. 
PN US2003087349-A1. 
PD 08-MAY-2003. 
PA (GETH ) GENENTECH INC. 

Query Match 3.1%; Score 

Best Local Similarity 21.3%; Pred. 



86. 
No. 



5; DB 
33; 



6; Length 622; 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: February 15, 2005, 09:44:27 



; Search time 43 Seconds 
(without alignments) 
907.941 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS GKLLGMAVWWLRGARKVKET 523 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 



513545 



Minimum DB seq length: 
Maximum DB seq length: 



0 

2000000000 



Post-processing: 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 1500 summaries 

Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/ l/iaa/5A_COMB . pep : * 

2: /cgn2_6/ptodata/l/iaa/5B_COMB.pep:* 

3: /cgn2_6/ptodata/l/iaa/6A_COMB.pep:* 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/ l/iaa/PCTUS_COMB . pep : * 

6: /cgn2_6/ptodata/l/iaa/backf ilesl.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


67 9 


24.5 


533 


4 


US-09-949-016-5947 


Sequence 5947, Ap 


2 


679 


24.5 


538 


4 


US-09-949-016-7684 


Sequence 7684, Ap 


3 


674 


24.3 


533 


5 


PCT-US92-00282-3 


Sequence 3, Appli 


4 


663.5 


24.0 


540 


4 


US-09-949-016-8465 


Sequence 8465, Ap 


5 


663.5 


24.0 


540 


4 


US-09-949-016-9052 


Sequence 9052, Ap 


6 


661.5 


23.9 


528 


4 


US-09-356-806-8 


Sequence 8, Appli 


7 


658 


23.8 


531 


5 


PCT-US92-00282-6 


Sequence 6, Appli 


8 


657 


23.7 


530 


3 


US-09-180-852-2 


Sequence 2, Appli 


9 


651.5 


23.5 


524 


4 


US-09-356-806-40 


Sequence 40, Appl 


10 


649 


23.4 


439 


4 


US-09-949-016-8605 


Sequence 8605, Ap 


11 


649 


23.4 


530 


4 


US-09-356-806-113 


Sequence 113, App 



12 


632.5 


22. 


9 


528 


4- 


US-09-949-016-6999 


Sequence 


6999, Ap 


13 


629 


22. 


7 


531 


5 


PCT-US92-00282-5 


Sequence 


5, Appli 


14 


606 


21. 


9 


534 


5 


PCT-US92-00282-4 


Sequence 


4, Appli 


15 


600 


21. 


7 


529 


5 


PCT-US92-00282-7 


Sequence 


7, Appli 


16 


592.5 


21. 


4 


388 


4 


US-09-949-016-8466 


Sequence 


8466, Ap 


17 


592.5 


21. 


4 


388 


4 . 


US-09-949-016-8467 


Sequence 


8467, Ap 


18 


548 


19. 


8 


440 


4 


US-09-949-016-8606 


Sequence 


8606, Ap 


19 


548 


19. 


8 


440 


4 


US-09-949-016-8607 


Sequence 


8607, Ap 


20 


521 


18. 


8 


454 


3 


US-09-813-918-2 


Sequence 


2, Appli 


21 


521 


18. 


8 


454 


4 


US-10-060-311-2 


Sequence 


2, Appli 


22 


503.5 


18. 


2 


288 


3 


US-09-813-918-3 


Sequence 


3, Appli 


23 


503.5 


18. 


2 


288 


4 


US-10-060-311-3 


Sequence 


3, Appli 


24 


380 


13. 


7 


245 


4 


US-09-305-856B-18 


Sequence 


18, Appl 


25 


378.5 


13. 


7 


389 


4 


US-09-270-767-45357 


Sequence 


45357, A 


26 


360.5 


13. 


0 


256 


4 


US-09-270-767-33692 


Sequence 


33692, A 


27 


360.5 


13. 


0 


256 


4 


US-09-270-767-48909 


Sequence 


48909, A 


28 


346.5 


12. 


5 


515 


3 


US-08-942-012B-32 


Sequence 


32, Appl 


29 


342.5 


12. 


4 


460 


3 


US-08-942-012B-33 


Sequence 


33, Appl 


30 


321 


11. 


6 


515 


3 


US-08-942-012B-24 


Sequence 


24, Appl 


31 


316.5 


11. 


4 


488 


3 


US-08-942-012B-29 


Sequence 


29, Appl 


32 


316.5 


11. 


4 


488 


3 


US-08-942-012B-30 


Sequence 


30, Appl 


33 


300 


10. 


8 


288 


4 


US-09-305-856B-2 


Sequence 


2, Appli 


34 


300 


10. 


8 


288 


5 


PCT-US92-002 82-19 


Sequence 


19, Appl 


35 


280 


10. 


1 


339 


4 


US-09-270-767-42493 


Sequence 


42493, A 


36 


272 


9. 


8 


493 


3 


US-08-942-012B-28 


Sequence 


28, Appl 


37 


259. 5 


9. 


4 


489 


3 


US-08-942-012B-31 


Sequence 


31, Appl 


38 


258.5 


9. 


3 


287 


4 


US-09-305-856B-10 


Sequence 


10, Appl 


39 


256 


9. 


2 


310 


4 


US-09-305-856B-14 


Sequence 


14, Appl 


40 


255 


9. 


2 


289 


4 


US-09-305-856B-8 


Sequence 


8, Appli 


41 


252 


9. 


1 


286 


5 


PCT-US92-00282-9 


Sequence 


9, Appli 


42 


249 


9. 


0 


289 


5 


PCT-US92-00282-11 


Sequence 


11, Appl 


43 


248 


9. 


0 


506 


3 


US-08-942-012B-26 


Sequence 


26, Appl 


44 


246.5 


8. 


9 


506 


3 


US-08-942-012B-25 


Sequence 


25, Appl 


45 


246.5 


8. 


9 


506 


6 


5180581-2 


Patent No. 


. 5180581 


46 


246. 5 


8. 


9 


506 


6 


5180581-2 


Patent No. 


, 5180581 


47 


244.5 


8. 


8 


289 


4 


US-09-305-856B-4 


Sequence 


4, Appli 


48 


244.5 


8. 


8 


289 


5 


PCT-US92-00282-15 


Sequence 


15, Appl 


49 


242 


8. 


7 


491 


3 


US-08-942-012B-27 


Sequence 


27, Appl 


50 


239 


8. 


6 


129 


4 


US-09-370-838-36 


Sequence 


36, Appl 


51 


239 


8. 


6 


129 


4 


US-09-854-133-36 


Sequence 


36, Appl 


52 


227 


8. 


2 


317 


4 


US-09-305-856B-12 


Sequence 


12, Appl 


53 


226 


8. 


2 


289 


4 


US-09-305-856B-6 


Sequence 


6, Appli 


54 


226 


8. 


2 


289 


5 


PCT-US92-00282-13 


Sequence 


13, Appl 


55 


223 


8. 


1 


253 


4 


US-09-305-856B-16 


Sequence 


16, Appl 


56 


214 


7. 


7 


52 


2 


US-08-466-583-7. 


Sequence 


7, Appli 


57 


214 


7. 


7 


52 


4 


US-08-265-427-7 


Sequence 


7, Appli 


58 


214 


7. 


7 


52 


5 


PCT-US95-07820-7 


Sequence 


7, Appli 


59 


209.5 


7. 


6 


134 


4 


US-09-270-767-33090 


Sequence 


33090, A 


60 


209.5 


7. 


6 


134 


4 


US-09-270-767-48307 


Sequence 


48307, A 


61 


202 


7. 


3 


58 


2 


US-08-466-583-9 


Sequence 


9, Appli 


62 


202 


7. 


3 


58 


4 


US-08-265-427-9 


Sequence 


9, Appli 


63 


202 


7. 


3 


58 


5 


PCT-US95-07820-9 


Sequence 


9, Appli 


64 


198.5 


7. 


2 


322 


4 


US-09-270-767-42787 


Sequence 


42787, A 


65 


196 


7. 


1 


287 


4 


US-09-270-767-33212 


Sequence 


33212, A 


66 


192 


6. 


9 


135 


4 


US-09-270-767-45571 


Sequence 


45571, A 


67 


186.5 


6. 


7 


141 


4 


US-09-270-767-32085 


Sequence 


32085, A 


68 


176 


6. 


4 


463 


4 


US-08-311-731A-7 


Sequence 


7, Appli 



69 


175 


6.3 


56 


2 


US-08-466-583-4 


Sequence 


4, Appli 


70 


175 


6.3 . 


56 


4 


US-08-265-427-4 


Sequence 


4, Appli 


71 


175 


6.3 


56 


5 


PCT-US95-07820-4 


Sequence 


4, Appli 


72 


174 


6.3 


425 


4 


US-09-902-540-11331 


Sequence 


11331, A 


73 


173.5 


6.3 


303 


4 


US-09-270-767-41486 


Sequence 


41486, A 


74 


173.5 


6.3 


399 


1 


US-08-096-623A-20 


Sequence 


20, Appl 


75 


173.5 


6.3 


431 


1 


US-07-783-705A-2 


Sequence 


2, Appli 


76 


172.5 


6.2 


154 


4 


US-09-270-767-32580 


Sequence 


32580, A 


77 


172.5 


6.2 


154 


4 


US-09-270-767-47797 


Sequence 


47797, A 


78 


169.5 


6.1 


397 


4 


US-09-902-540-14631 


Sequence 


14631, A 


79 


168 


6.1 


74 


5 


PCT-US92-00282-24 


Sequence 


24, Appl 


80 


168 


6.1 


130 


4 


US-09-270-767-57790 


Sequence 


57790, A 


81 


167.5 


6.1 


397 


4 


US-09-724-797-26 


Sequence 


26, Appl 


82 


164 


5.9 


471 


2 


US-08-466-583-2 


Sequence 


2, Appli 


83 


164 


5.9 


471 


4 


US-08-265-427-2 


Sequence 


2, Appli 


84 


164 


5.9 


471 


5 


PCT-US95-07820-2 


Sequence 


2, Appli 


85 


162 


5.9 


459 


4 


US-09-679-263-4 


Sequence 


4, Appli 


86 


162 


5.9 


503 


4 


US-09-679-263-5 


Sequence 


5, Appli 


87 


161 


5.8 


454 


4 


US-09-679-263-12 


Sequence 


12, Appl 


88 


161 


5.8 


1198 


4 


US-09-284-768A-10 


Sequence 


10, Appl 


89 


160 


5.8 


473 


4 


US-09-284-768A-24 


Sequence 


24, Appl 


90 


159 


5.7 


63 


2 


US-08-466-583-8 


Sequence 


8, Appli 


91 


159 


5.7 


63 


4 


US-08-265-427-8 


Sequence 


8, Appli 


92 


159 


5.7 


63 


5 


PCT-US95-07820-8 


Sequence 


8, Appli 


93 


159 


5.7 


131 


2 


US-08-466-583-5 


Sequence 


5, Appli 


94 


159 


5.7 


131 


4 


US-08-265-427-5 


Sequence 


5, Appli 


95 


159 


5.7 


131 


5 


PCT-US95-07 820-5 


Sequence 


5, Appli 


96 


158 


5.7 


190 


4 


US-09-615-192A-301 


Sequence 


301, App 


97 


157.5 


5.7 


131 


2 


US-08-466-583-6 


Sequence 


6, Appli 


98 


157. 5 


5.7 


131 


4 


US-08-265-427-6 


Sequence 


6, Appli 


99 


157.5 


5.7 


131 


5 


PCT-US95-07820-6 


Sequence 


6, Appli 


100 


154 


5.6 


452 


4 


US-09-889-738-21 


Sequence 


21, Appl 


101 


153.5 


5.5 


452 


4 


US-08-311-731A-6 


Sequence 


6, Appli 


102 


149.5 


5.4 


414 


1 


US-09-337-913-1 


Sequence 


1, Appli 


103 


149.5 


5.4 


414 


2 


US-08-750-524-1 


Sequence 


1, Appli 


104 


147 


5.3 


185 


4 


US-09-270-767-56708 


Sequence 


56708, A 


105 


145. 5 


5.3 


471 


3 


US-09-106-464-2 


Sequence 


2 , Appli 


106 


142 


5.1 


78 


4 


US-09-513-999C-7361 


Sequence 


7361, Ap 


107 


139 


5.0 


476 


4 


US-09-673-300-10 


Sequence 


10, Appl 


108 


137 


4.9 


128 


4 


US-09-270-767-32549 


Sequence 


32549, A 


109 


136.5 


4.9 


488 


2 


US-08-797-226-2 


Sequence 


2, Appli 


110 


134 


4.8 


283 


4 


US-09-270-767-44687 


Sequence 


44687, A 


111 


134 


4.8 


488 


4 


US-09-673-300-8 


Sequence 


8, Appli 


112 


132.5 


4.8 


337 


4 


US-09-270-767-46773 


Sequence 


46773, A 


113 


131.5 


4.8 


419 


4 


US-09-724-797-95 


Sequence 


95, Appl 


114 


126. 5 


4.6 


481 


4 


US-09-673-300-2 


Sequence 


2, Appli 


115 


120.5 


4.4 


422 


4 


US-09-679-279-17 


Sequence 


17, Appl 


116 


116.5 


4.2 


1114 


2 


US-08-576-626A-31 


Sequence 


31, Appl 


117 


114 


4.1 


2544 


2 


US-08-576-626A-32 


Sequence 


32, Appl 


118 


113 


4.1 


415 


2 


US-08-576-626A-52 


Sequence 


52, Appl 


119 


113 


4.1 


419 


4 


US-09-107-532A-4894 


Sequence 


4894, Ap 


120 


112.5 


4.1 


448 


4 


US-09-284-768A-23 


Sequence 


23, Appl 


121 


112.5 


4.1 


637 


4 


US-09-284-768A-20 


Sequence 


20, Appl 


122 


112.5 


4.1 


674 


4 


US-09-284-768A-21 


Sequence 


21, Appl 


123 


111 


4.0 


443 


1 


US-08-660-765A-4 


Sequence 


4, Appli 


124 


110.5 


4.0 


378 


4 


US-09-724-797-40 


Sequence 


40, Appl 


125 


109.5 


4.0 


652 


4 


US-09-489-039A-12461 


Sequence 


12461, A 



126 


109 


3. 


9 


408 


2 


US-08-924-847A-2 


Sequence 


2, Appli 


127 


109 


3. 


9 


... 4 08 


3 


US-09-120-052-2 


Sequence 


2, Appli 


128 


108 


3. 


9 


3079 


5 


PCT-US94-00198-4 


Sequence 


4, Appli 


129 


107 


3. 


9 


372 


1 


US-08-196-218-33 


Sequence 


33 , Appl 


130 


107 


3. 


9 


372 


1 


US-08-681-953-33 


Sequence 


33, Appl 


131 


106.5 


3. 


8 


421 


2 


US-08-576-626A-53 


Sequence 


53, Appl 


132 


106 


3. 


8 


98 


5 


PCT-US92-00282-26 


Sequence 


26, Appl 


133 


105 


3. 


8 


408 


2 


US-08-926-258-2 


Sequence 


2, Appli 


134 


105 


3. 


8 


408 


2 


US-09-120-053-2 


Sequence 


2, Appli 


135 


105 


3. 


8 


408 


3 


US-09-198-212-2 


Sequence 


2, Appli 


136 


105 


3. 


8 


408 


4 


US-09-789-261-2 


Sequence 


2, Appli 


137 


.104 


3. 


8 


417 


4 


US-09-679-279-11 


Sequence 


11, Appl 


138 


103.5 


3. 


7 


426 


4 


US-09-540-236-2053 


Sequence 


2053, Ap 


139 


103 


3. 


7 


967 


3 


US-09-139-802-201 


Sequence 


201, App 


140 


103 


3. 


7 


967 


4 


US-09-659-786-201 


Sequence 


201, App 


141 


101. 5 


3. 


7 


436 


4 


US-09-679-279-4 


Sequence 


4, Appli 


142 


100 


3. 


6 


101 


4 


US-09-270-767-58100 


Sequence 


58100, A 


143 


100 


3. 


6 


388 


4 


US-09-602-787A-180 


Sequence 


180, App 


144 


100 


3. 


6 


455 


3 


US-09-036-987A-17 


Sequence 


17, Appl 


145 


100 


3. 


6 


455 


3 


US-09-370-700-17 


Sequence 


17, Appl 


146 


100 


3. 


6 


455 


4 


US-09-603-207-17 


Sequence 


17, Appl 


147 


98.5 


3. 


6 


391 


3 


US-09-134-001C-3952 


Sequence 


3952, Ap 


148 


98.5 


3. 


6 


834 


1 


US-07-977-434-8 


Sequence 


8, Appli 


149 


98.5 


3. 


6 


834 


1 


US-08-458-819-8 


Sequence 


8, Appli 


150 


98.5 


3. 


6 


834 


5 


PCT-US91-07035-8 


Sequence 


8, Appli 


151 


98 


3. 


5 


528 


4 


US-09-489-039A-13077 


Sequence 


13077, A 


152 


97.5 


3. 


5 


616 


4 


US-09-873-404-4 


Sequence 


4, Appli 


153 


97.5 


3. 


5 


616 


4 


US-10-243-735-4 


Sequence 


4, Appli 


154 


97 . 5 


3. 


5 


639 


4 


US-09-328-352-7786 


Sequence 


7786, Ap 


155 


96.5 


3. 


5 


608 


4 


US-09-284-768A-4 


Sequence 


4, Appli 


156 


96.5 


3. 


5 


991 


4 


US-09-24 8-796A-15239 


Sequence 


15239, A 


157 


96.5 


3. 


5 


1175 


4 


US-09-792-024-75 


Sequence 


75, Appl 


158 


95.5 


3. 


5 


1006 


4 


US-09-949-016-10730 . 


Sequence 


10730, A 


159 


95 


3. 


4 


72 


4 


US-09-270-767-61084 


Sequence 


61084, A 


160 


94. 5 


3. 


4 


187 


4 


US-09-270-767-41857 


Sequence 


41857, A 


161 


94.5 


3. 


4 


605 


4 


US-09-248-796A-19205 


Sequence 


19205, A 


162 


94.5 


3. 


4 


705 


4 


US-09-198-452A-68 


Sequence 


68, Appl 


163 


94.5 


3. 


4 


705 


4 


US-09-438-185A-50 


Sequence 


50, Appl 


164 


94.5 


3. 


4 


838 


4 


US-09-758-282B-265 


Sequence 


265, App 


165 


94.5 


3. 


4 


838 


4 


US-09-577-304A-265 


Sequence 


265, App 


166 


94 


3. 


4 


1199 


4 


US-09-134-000C-5542 


Sequence 


5542, Ap 


167 


93.5 


3. 


4 


491 


4 


US-09-489-039A-12234 


Sequence 


12234, A 


168 


93.5 


3. 


4 


669 


4 


US-09-252-991A-31488 


Sequence 


31488, A 


169 


93.5 


3. 


4 


1837 


4 


US-09-438-185A-98 


Sequence 


98, Appl 


170 


92.5 


3. 


3 


460 


1 


US-08-351-981-4 


Sequence 


4, Appli 


171 


92 


3. 


3 


452 


4 


US-09-284-768A-22 


Sequence 


22, Appl 


172 


92 


3. 


3 


480 


3 


US-09-182-859-4 


Sequence 


4, Appli 


173 


92 


3. 


3 


480 


3 


US-09-170-670-5 


Sequence 


5, Appli 


174 


92 


3. 


3 


480 


3 


US-09-193-068-5 


Sequence 


5, Appli 


175 


92 


3. 


3 


480 


3 


US-09-183-412-5 


Sequence 


5, Appli 


176 


92 


3. 


3 


480 


3 


US-09-290-734-5 


Sequence 


5, Appli 


177 


92 


3. 


3 


480 


4 


US-09-672-459-4 


Sequence 


4, Appli 


178 


92 


3. 


3 


480 


4 


US-09-545-586-5 


Sequence 


5, Appli 


179 


92 


3. 


3 


480 


4 


US-10-186-042-4 


Sequence 


4, Appli 


180 


92 


3. 


3 


480 


4 


US-09-769-864-5 


Sequence 


5, Appli 


181 


92 


3. 


3 


483 


2 


US-08-600-908A-13 


Sequence 


13, Appl 


182 


92 


3. 


3 


483 


3 


US-08-683-838A-13 


Sequence 


13, Appl 



183 


92 


3. 


3 


483 


3 


US- 


09- 


•291- 


023A-16 


Sequence 


16, Appl 


184 


92 


3. 


3 


483 


4 


US- 


09- 


537- 


168-6 


Sequence 


6, Appli 


185 


92 


3. 


3 


483 


4 


US- 


09- 


636- 


252A-13 


Sequence 


13 , Appl 


186 


92 


3. 


3 


483 


4 


US- 


09- 


•381- 


687-6 


Sequence 


6, Appli 


187 


92 


3. 


3 


483 


4 


us- 


09- 


•540- 


715A-16 


Sequence 


1 6 , Appl 


188 


92 


3. 


3 


. 514 


1 


us- 


08- 


720- 


•899-4 


Sequence 


A, Appli 


189 


92 


3. 


3 


514 


1 


us- 


08- 


459- 


610-4 


Sequence 


A, Appli 


190 


92 


3. 


3 


514 


2 


us- 


08- 


343- 


•804-4 


Sequence 


A, Appli 


191 


92 


3. 


3 


514 


2 


us- 


08- 


687- 


•399-4 


Sequence 


4, Appli 


192 


92 


3. 


3 


514 


2 


us- 


08- 


•600- 


•908A-4 


Sequence 


4, Appli 


193 


92 


3. 


3 


514 


3 


us~ 


08- 


683- 


838A-4 


Sequence 


4, Appli 


194 


92 


3. 


3 


514 


3 


us- 


09- 


264- 


097-4 


Sequence 


4, Appli 


195 


92 


3. 


3 


514 


4 


us- 


09- 


636- 


252A-4 


Sequence 


A, Appli 


196 


92 


3. 


3 


520 


1 


us- 


08- 


468- 


700-36 


Sequence 


36, Appl 


197 


92 


3. 


3 


520 


1 


us- 


08- 


645- 


971-4 


Sequence 


4 , Appli 


198 


92 


3. 


3 


520 


2 


us- 


08- 


468- 


220-34 


Sequence 


34, Appl 


199 


92 


3. 


3 


520 


2 


us- 


08- 


468- 


698-34 


Sequence 


34, Appl 


200 


92 


3. 


3 


520 


2 


us- 


08- 


704- 


706A-36 


Sequence 


36, Appl 


201 


92 


3. 


3 


520 


3 


us- 


08- 


890- 


383-5 


Sequence 


5, Appli 


202 


92 


3. 


3 


520 


3 


us- 


08- 


914- 


67 9A-5 


Sequence 


5, Appli 


203 


92 


3. 


3 


520 


3 


us- 


08- 


985- 


659-37 


Sequence 


31, Appl 


204 


92 


3. 


3 


520 


3 


us- 


08- 


194- 


664A-34 


Sequence 


34, Appl 


205 


92 


3. 


3 


520 


5 


PCT 


-US94-01553A-34 


Sequence 


34, Appl 


206 


92 


3. 


3 


520 


5 


PCT 


-US95-10426-34 


Sequence 


34, Appl 


207 


92 


3. 


3 


657 


4 


us- 


09- 


2'84- 


768A-7 


Sequence 


7, Appli 


208 


92 


3. 


3 


817 


4 


us- 


09- 


248- 


796A-17089 


Sequence 


17089, A 


209 


91.5 


3. 


3 


143 


4 


us- 


09- 


270- 




Sequence 


62394, A 


210 


91 


3. 


3 


1826 


4 


us- 


09- 


198- 


452A-113 


Sequence 


113, App 


211 


90.5 


3. 


3 


727 


4 


us- 


09- 


543- 


681A-6690 


Sequence 


6690, Ap 


212 


90.5 


3. 


3 


1153 


1 


us- 


08- 


097- 


997A-14 


Sequence 


14, Appl 


213 


90 


3. 


3 


272 


4 


us- 


09- 


902- 


540-14018 


Sequence 


14018, A 


214 


90 


3. 


3 


366 


3 


us- 


09- 


134- 


001C-3198 


Sequence 


3198, Ap 


215 


90 


3. 


3 


620 


4 


us- 


09- 


861- 


451A-34 


Sequence 


34, Appl 


216 


89.5 


3. 


2 


471 


4 


us~ 


09- 


051- 


961-7 


Sequence 


7, Appli 


217 


89 


3. 


2 


313 


4 


us- 


09- 


710- 


279-1216 


Sequence 


1216, Ap 


218 


89 


3. 


2 


357 


4 


us- 


09- 


710- 


279-410 


Sequence 


410, App 


219 


89 


3. 


2 


372 


4 


us- 


09- 


710- 


279-182 


Sequence 


182, App 


220 


88.5 


3. 


2 


197 


4 


us- 


09- 


270- 


767-32922 


Sequence 


32922, A 


221 


88.5 


3. 


2 


197 


4 


us- 


09- 


270- 


767-48139 


Sequence 


48139, A 


222 


88.5 


3. 


2 


726 


3 


us- 


09- 


126- 


980-2 


Sequence 


2, Appli 


223 


88.5 


3. 


2 


726 


3 


us- 


09- 


476- 


482-2 


Sequence 


2, Appli 


224 


88.5 


3. 


2 


726 


3 


us- 


09- 


517- 


605-6 


Sequence 


6, Appli 


225 


88 


3. 


2 


388 


4 


us- 


10- 


138- 


701-57 


Sequence 


57, Appl 


226 


88 


3. 


2 


540 


4 


us- 


09- 


248- 


796A-15935 


Sequence 


15935, A 


227 


88 


3. 


2 


745 


4 


us- 


09- 


543- 


681A-4267 


Sequence 


4267, Ap 


228 


88 


3. 


2 


838. 


4 


us- 


09- 


758- 


282B-261 


Sequence 


261, App 


229 


88 


3. 


2 


838 


4 


us- 


09- 


577- 


304A-261 


Sequence 


261, App 


230 


88 


3. 


2 


1153 


3 


us- 


08- 


665- 


574C-14 


Sequence 


14, Appl 


231 


88 


3. 


2 


1153 


3 


us- 


08- 


946- 


994-14 


Sequence 


14, Appl 


232 


88 


3. 


2 


1153 


4 


us- 


09- 


963- 


137-202 


Sequence 


202, App 


233 


87.5 


3. 


2 


460 


1 


us- 


08- 


•351- 


981-9 


Sequence 


9, Appli 


234 


87.5 


3. 


2 


729 


4 


us- 


09- 


949- 


016-10790 


Sequence 


10790, A 


235 


87.5 


3. 


2 


729 


4 


us- 


09- 


949- 


016-10791 


Sequence 


10791, A 


236 


87.5 


3. 


2 


1075 


4 


us- 


09- 


949- 


016-8308 


Sequence 


8308, Ap 


237 


87.5 


3. 


2 


1452 


2 


us- 


08- 


•449- 


•644-8 


Sequence 


8, Appli 


238 


87.5 


3. 


2 


1452 


2 


us- 


08- 


087- 


244A-8 


Sequence 


8, Appli 


239 


87 


3. 


1 


565 


2 


us- 


08- 


357- 


•533A-9 


Sequence 


9, Appli 



240 


87 


3. 


1 


565 


2 


us- 


08- 


459- 


009-9 


Sequence 


9, Appli 


241 


87 


3. 


1 


565 


3 


us- 


08- 


459- 


951-9 . 


Sequence 


9, Appli 


242 


87 


3. 


1 


944 


4 


us- 


09- 


107- 


532A-4864 


Sequence 


4864, Ap 


243 


86.5 


3. 


1 


542 


4 


us- 


09- 


198- 


452A-496 


Sequence 


496, App 


244 


86.5 


3. 


1 


696 


4 


us- 


09- 


438- 


185A-464 


Sequence 


464, App 


245 


86.5 


3. 


•1 


908 


4 


us- 


09- 


623- 


326-9 


Sequence 


9, Appli 


246 


86.5 


3. 


1 


1022 


1 


us- 


08- 


271- 


364A-8 


Sequence 


8, Appli 


247 


86.5 


3. 


1 


1022 


2 


us- 


08- 


222- 


715B-27 


Sequence 


27, Appl 


248 


86.5 


3. 


1 


1060 


4 


us- 


09- 


248- 


796A-16624 


Sequence 


16624, A 


249 


86.5 


3. 


1 


2777 


4 


us- 


09- 


543- 


681A-6124 


Sequence 


6124, Ap 


250 


86 


3. 


1 


197 


3 


us- 


09- 


813- 


918-4 


Sequence 


4, Appli 


251 


86 


3. 


1 


197 


4 


us- 


10- 


060- 


311-4 


Sequence 


4, Appli 


252 


86 


3. 


1 


268 


4 


us- 


09- 


371- 


338-21 


Sequence 


21, Appl 


253 


86 


3. 


1 


355 


4 


us- 


09- 


501- 


115-12 


Sequence 


12, Appl 


254 


86 


3 . 


1 


364 


4 


us- 


09- 


583- 


110-5235 


Sequence 


5235, Ap 


255 


86 


3. 


1 


690 


4 


us- 


09- 


371- 


338-19 


Sequence 


19, Appl 


256 


85.5 


3. 


1 


308 


4 


us- 


09- 


248- 


796A-14227 


Sequence 


14227, A 


257 


85.5 


3. 


1 


352 


2 


us- 


08- 


751- 


474-2 


Sequence 


2, Appli 


258 


85.5 


3. 


1 


352 


4 


us- 


09- 


583- 


110-2857 


Sequence 


2857, Ap 


259 


85.5 


3 . 


1 


355 


4 


us- 


09- 


107- 


433-4605 


Sequence 


4605, Ap 


260 


85.5 


3. 


1 ( 


389 


4 


us- 


09- 


134- 


000C-5382 


Sequence 


5382, Ap 


261 


85 . 5 


3. 


1 


493 


4 


us- 


09- 


925- 


637-24 


Sequence 


24, Appl 


262 


85.5 


3. 


1 


494 


3 


us- 


08- 


988- 


251-2 


Sequence 


2, Appli 


263 


85 . 5 


3. 


1 


494 


3 


us- 


09- 


386- 


048-2 


Sequence 


2, Appli 


264 


85. 5 


3. 


1 


617 


4 


us- 


09- 


252- 


991A-29507 


Sequence 


29507, A 


265 


85. 5 


3. 


1 


658 


3 


us- 


08- 


953- 


040-9 


Sequence 


9, Appli 


266 


85 . 5 


3. 


1 


898 


. 4 


us- 


09- 


489- 


039A-10322 


Sequence 


10322, A 


267 


85.5 


3. 


1 


930 


3 


us- 


08- 


953- 


040-2 


Sequence 


2, Appli 


268 


85.5 


3. 


1 


930 


4 


us- 


09- 


583- 


110-3208 


Sequence 


3208, Ap 


269 


85.5 


3 . 


1 


939 


4 


us- 


09- 


107- 


433-4543 


Sequence 


4543, Ap 


270 


85 


3. 


1 


893 


1 


us- 


07- 


977- 


434-4 


Sequence 


4, Appli 


271 


85 


3 . 


1 


893 


1 


us- 


08- 


458- 


819-4 


Sequence 


4, Appli 


272 


85 


3. 


1 


893 


3 


us- 


09- 


105- 


697-10 


Sequence 


10, Appl 


273 


85 


3 . 


1 


893 


3 


us- 


09- 


514- 


302-4 


Sequence 


4, Appli 


274 


85 


3 . 


1 


893 


4 


us- 


09- 


623- 


326-18 


Sequence 


18, Appl 


275 


85 


3. 


1 


893 


4 


us- 


10- 


014- 


436-4 


Sequence 


4, Appli 


276 


85 


3. 


1 


893 


5 


PCT 


'-US91-07035-4 


Sequence 


4 , Appli 


277 


85 


3. 


1 


1247 


4 


us- 


09- 


134- 


000C-4968 


Sequence 


4968, Ap 


278 


85 


3. 


1 


1540 


4 


us- 


09- 


949- 


016-11382 


Sequence 


11382, A 


279 


85 


3. 


1 


1540 


4 


us- 


09- 


949- 


016-11383 


Sequence 


11383, A 


280 


85 


3. 


1 


1719 


2 


us- 


08- 


•459- 


568-4 


Sequence 


4, Appli 


281 


85 


3. 


1 


1719 


2 


us- 


08- 


399- 


•411-4 


Sequence 


4, Appli 


282 


85 


3. 


1 


1719 


3 


us- 


•08- 


•516- 


•859A-4 


Sequence 


4, Appli 


283 


85 


3. 


1 


1719 


3 


us- 


09- 


586- 


472-4 


Sequence 


4 , Appli 


284 


85 


3. 


1 


1719 


4 


us- 


09- 


528- 


706-4 


Sequence 


4, Appli 


285 


85 


3. 


1 


1938 


3 


us- 


09-. 


•5.14- 


302-2 


Sequence 


2 , Appli 


286 


85 


3 . 


1 


1938 


4 


us- 


10- 


•014- 


436-2 


Sequence 


2, Appli 


287 


84 . 5 


3. 


1 


319 


1 


us- 


•08- 


•597- 


236-7 


Sequence 


7, Appli 


288 


84.5 


3. 


1 


319 


1 


us- 


•08- 


•746- 


•682A-7 


Sequence 


7, Appli 


289 


84.5 


3. 


1 


447 


3 


us- 


09- 


-305- 


•001-2 


Sequence 


2, Appli 


290 


84.5 


3. 


1 


447 


4 


us- 


•09- 


•583- 


■110-3287 


Sequence 


3287, Ap 


291 


84.5 


3. 


1 


451 


4 


us- 


•09- 


•107- 


■433-3636 


Sequence 


3636, Ap 


292 


84.5 


3. 


1 


471 


4 


us- 


•09- 


•538- 


•092-938 


Sequence 


938, App 


293 


84.5 


3. 


1 


481 


4 


us- 


•09- 


•673- 


•395A-277 


Sequence 


277, App* 


294 


84.5 


3. 


1 


489 


4 


us- 


•09- 


•949- 


•016-7030 


Sequence 


7030, Ap 


295 


84.5 


3. 


1 


492 


3 


us- 


•08- 


•984- 


•618-2 


Sequence 


2, Appli 


296 


84.5 


3. 


1 


635 


4 


us- 


: 10- 


•101- 


•464A-932 


Sequence 


932, App 



297 


84. 5 


3. 


1 
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6, Appli 


679 


75.5 


2.7 


834 


2 


US-08-823-516-6 


Sequence 


6, Appli 


680 


75.5 


2.7 


834 


3 


US-08-682-853A-6 


Sequence 


6, Appli 


681 


75.5 


2.7 


834 


3 


US-08-759-038-6 


Sequence 


6, Appli 


682 


75.5 


2.7 


834 


3 


US-08-758-314-6 


Sequence 


6, Appli 


683 


75.5 


2.7 


834 


3 


US-09-350-309-6 


Sequence 


6, Appli 


684 


75.5 


2.7 


834 


3 


US-08-520-94.6-6 


Sequence 


6, Appli 


685 


75.5 


2.7 


834 


4 


US-09-684-938-6 


Sequence 


6, Appli 


686 


75.5 


2.7 


834 


4 


US-09-308-825A-6 


Sequence 


6, Appli 


687 


75.5 


2.7 


834 


4 


US-09-758-282B-267 


Sequence 


267, App 


688 


75.5 


2.7 


834 


4 


US-09-655-378A-6 


Sequence 


6, Appli 


689 


75.5 


2.7 


834 


4 


US-09-940-244-6 


Sequence 


6, Appli 


690 


75.5 


2.7 


834 


4 


US-09-333-145-6 


Sequence 


6, Appli 


691 


75.5 


2.7 


834 


4 


US-09-577-304A-267 


Sequence 


267, App 


692 


75.5 


2.7 


834 


5 


PCT-US91-07035-10 


Sequence 


10, Appl 


693 


75.5 


2.7 


838 


3 


US-09-235-451-2 


Sequence 


2, Appli 


694 


75.5 


2.7 


838 


4 


US-09-132-316-3 


Sequence 


3, Appli 


695 


75.5 


2.7 


838 


4 


US-09-667-422-9 


Sequence 


9, Appli 



696 


75 


.5 


2. 


7 


838 


4 


us- 


09- 


978- 


303-2 


Sequence 


2, Appli 


697 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-72 


Sequence 


72, Appl 


698 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-91 


Sequence 


91, Appl 


699 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-160 


Sequence 


160, App 


700 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-163 


Sequence 


163, App 


701 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-166 


Sequence 


166, App 


702 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-212 


Sequence 


212, App 


703 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-214 


Sequence 


214, App 


704 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


758- 


282B-216 


Sequence 


216, App 


705 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-72 


Sequence 


72, Appl 


706 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-91 


Sequence 


91, Appl 


707 


75 


. 5 


2 . 


7 


842 


4 


us- 


09- 


577- 


304A-160 


Sequence 


160, App 


708 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-163 


Sequence 


163, App 


709 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-166 


Sequence 


166, App 


710 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-212 


Sequence 


212, App 


711 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-214 


Sequence 


214, App 


712 


75 


.5 


2. 


7 


842 


4 


us- 


09- 


577- 


304A-216 


Sequence 


216, App 


713 


75 


.5 


2 . 


7 


842 


4 


us~ 


09- 


777- 


430C-20 


Sequence 


20, Appl 


714 


75.5 


2 . 


7 


842 


4 


us- 


09- 


777- 


•430C-23 


Sequence 


23, Appl 


715 


75 


.5 


2 . 


7 


842 


4 


us- 


09- 


777- 


•430C-26 


Sequence 


26, Appl 


716 


75 


.5 


2 . 


7 


915 


1 


us- 


08- 


346- 


■455B-69 


Sequence 


69, Appl 


717 


75 


.5 


2 . 


7 


915 


3 


us- 


08- 


977- 


•221-69 


Sequence 


69, Appl 


718 


75 


.5 


2. 


7 


915 


4 


us- 


09- 


483- 


■831B-69 


Sequence 


69, Appl 


719 


75 


.5 


2 . 


7 


915 


5 


PCT 


'-US95-06613-69 


Sequence 


69, Appl 


720 


75 


.5 


2\ 


7 


1207 


4 


us- 






-SQ4-4 

+J U *1 M 


Sequence 


4, Appli 


721 


75 


.5 


2 . 


7 


1646 


4 


us- 






008-67 


Sequence 


67, Appl 


722 


75 


.5 


2 . 


7 


1647 


4 


us- 








Sequence 


2, Appli 


723 


75 


.5 


2. 


7 


1647 


4 


us- 


09- 


•824- 


■574-4 


Sequence 


4, Appli 


724 


75 


.5 


2. 


7 


1647 


4 


us- 


•09- 


•538- 


-092-1172 


Sequence 


1172, Ap 


725 


75 


.5 


2. 


7 


1649 


4 


us- 


09- 


-535- 


-008-75 


Sequence 


75, Appl 


726 


75 


.5 


2. 


7 


1650 


4 


us- 


09- 


-535- 


-008-71 


Sequence 


71, Appl 


727 


75 


.5 


2. 


7 


1659 


4 


us- 


•09- 


-949- 


-016-9752 


Sequence 


9752, Ap 


728 


75 


.5 


2.7 


1678 


4 


us- 


•09- 


-535- 


-008-69 


Sequence 


69, Appl 


729 


75 


.5 


2. 


7 


1679 


4 


us- 


•09- 


-535- 


-008-65 


Sequence 


65, Appl 


730 


75.5 


2. 


7 


1681 


4 


us- 


•09- 


-535- 


-008-77 


Sequence 


77, Appl 


731 


75 


.5 


2. 


7 


1682 


4 


us- 


■09- 


-535- 


-008-73 


Sequence 


73, Appl 


732 


75 


.5 


2. 


7 


2057 


4 


us- 


-09- 


-499- 


-203-2 


Sequence 


2, Appli 


733 




75 


2. 


7 


205 


4 


us- 


•09- 


-248- 


-796A-22069 


Sequence 


22069, A 


734 




75 


2. 


7 


326 


4 


us- 


-09- 


-302- 


-626B-54 


Sequence 


54, Appl 


735 




75 


2. 


7 


357 


4 


us- 


-09- 


-198- 


-452A-977 


Sequence 


977, App 


736 




75 


2. 


7 


359 


4 


us- 


■09- 


-438- 


-185A-906 


Sequence 


906, App 


737 




75 


2. 


7 


362 


4 


us- 


-09- 


-248- 


-796A-20514 


Sequence 


20514, A 


738 




75 


2. 


,7 


372 


4 


us- 


-09- 


-538- 


-092-1035 


Sequence 


1035, Ap 


739 




75 


2. 


,7 


414 


4 


us- 


-09- 


-583- 


-110-4310 


Sequence 


4310, Ap 


740 




75 


2. 


,7 


415 


4 


us- 


-09- 


-107- 


-433-3943 


Sequence 


3943, Ap 


.7.41 




75 


2. 


,7 


418 


4 


us- 


-09- 


-248- 


-796A-18314.. . 


Sequence 


18314, A 


742 




75 


2, 


,7 


453 


3 


us- 


-09- 


-134- 


-001C-3749 


Sequence 


3749, Ap 


743 




75 


2, 


.7 


459 


4 


us- 


-09- 


-248- 


-796A-19036 


Sequence 


19036, A 


744 




75 


2. 


.7 


473 


4 


us- 


-09- 


-489- 


-039A-9854 


Sequence 


9854, Ap 


745 




75 


2. 


.7 


489 


4 


us- 


-09- 


-949- 


-016-9590 


Sequence 


9590, Ap 


746 




75 


2, 


.7 


491 


4 


us- 


-09- 


-710- 


-279-2808 


Sequence 


2808, Ap 


747 




75 


2, 


.7 


522 


1 


us- 


-08- 


-680- 


-726A-58 


Sequence 


58, Appl 


748 




75 


2, 


.7 


522 


3 


us- 


-09- 


-092- 


-409-58 


Sequence 


58, Appl 


749 




75 


2 


.7 


525 


3 


us- 


-08- 


-984- 


-618-17 


Sequence 


17, Appl 


750 




75 


2 


.7 


540 


3 


us- 


-08- 


-964- 


-268-6 


Sequence 


6, Appli 


751 




75 


2 


.7 


540 


4 


us- 


-09- 


-105- 


-254-6 


Sequence 


6, Appli 


752 




75 


2 


.7 


585 


4 


us- 


-09- 


-489- 


-039A-10279 


Sequence 


10279, A 



753 


75 


2.7 


776 


1 


US-08-021-601-2 


Sequence 


2, Appli 


754 


75 . 


2.7 


776 


1. 


US-08-082-.849B-2 


Sequence 


2, Appli 


755 


75 


2.7 


776 


5 


PCT-US94-01624-2 


Sequence 


2, Appli 


756 


75 


2.7 


831 


1 


US-08-073-384C-5 


Sequence 


5, Appli 


757 


75 


2.7 


831 


1 


US-08-254-359A-5 


Sequence 


5, Appli 


758 


75 


2.7 


831 


1 


US-08-483-043-5 


Sequence 


5, Appli 


759 


75 


2.7 


831 


1 


US-08-481-238-5 


Sequence 


5, Appli 


760 


75 


2.7 


831 


2 


US-08-471-066B-5 


Sequence 


5, Appli 


761 


75 


2.7 


831 


2 


US-08-484-956-5 


Sequence 


5, Appli 


7 62 


75 


2.7 


831 


2 


US-08-757-653-5 


Sequence 


5, Appli 


763 


75 


2.7 


831 


2 


US-08-599-491-5 


Sequence 


5, Appli 


764 


75 


2.7 


831 


2 


US-08-756-386-5 


Sequence 


5, Appli 


765 


75 


2.7 


831 


2 


US-08-823-516-5 


Sequence 


5, Appli 


766 


75 


2.7 


831 


3 


US-08-682-853A-5 


Sequence 


5, Appli 


767 


75 


2.7 


831 


3 


US-08-759-038-5 


Sequence 


5, Appli 


768 


75 


2.7 


831 


3 


US-08-758-314-5 


Sequence 


5, Appli 


769 


75 


2.7 


831 


3 


US-09-350-309-5 


Sequence 


5, Appli 


770 


75 


2.7 


831 


3 


US-08-520-946-5 


Sequence 


5, Appli 


771 


75 


2.7 


831 


4 


US-09-684-938-5 


Sequence 


5, Appli 


772 


75 


2.7 


831 


4 


US-09-308-825A-5 


Sequence 


5, Appli 


773 


75 


2.7 


831 


' 4 


US-09-758-282B-5 


Sequence 


5, Appli 


774 


75 


2.7 


831 


4 


US-09-655-378A-5 


Sequence 


5, Appli 


775 


75 


2.7 


831 


4 


US-09-940-244-5 


Sequence 


5, Appli 


776 


75 


2.7 


831 


4 


US-09-333-145-5 


Sequence 


5, Appli 


777 


75 


2.7 


831 


4 


US-09-577-304A-5 


Sequence 


5, Appli 


778 


75 


2.7 


871 


4 


US-09-792-024-81 


Sequence 


81 , Appl 


779 


75 


2.7 


894 


1 


US-08-117-362-4 


Sequence 


4, Appli 


780 


75 


2.7 


894 


1 


US-08-486-924-4 


Sequence 


4, Appli 


781 


75 


2.7 


894 


4 


US-08-486-929A-4 


Sequence 


4 , Appli 


782 


75 


2.7 


990 


2 


US-08-645-193B-15 


Sequence 


15 , Appl 


783 


75 


2.7 


1164 


3 


US-09-457-708-2 


Sequence 


2, Appli 


784 


75 


2.7 


1164 


4 


US-09-950-046A-2 


Sequence 


2, Appli 


785 


75 


2.7 


1164 


4 


US-09-976-594-989 


Sequence 


989, App 


786 


75 


2.7 


1228 


4 


US-09-94 9-016-6805 


Sequence 


6805, Ap 


787 


75 


2.7 


1729 


4 


US-09-134-000C-5675 


Sequence 


5675, Ap 


788 


75 


2.7 


1990 


4 


US-09-902-540-11251 


Sequence 


11251, A 


789 


74 . 5 


2.7 


183 


4 


US-09-270-767-32353 


Sequence 


32353, A 


790 


74 . 5 


2.7 


183 


4 


US-09-27 0-767-47570 


Sequence 


47570, A 


791 


74 . 5 


2.7 


311 


4 


US-09- 14 8-545- 191 


Sequence 


191, App 


792 


74.5 


2.7 


312 


3 


US-09-424-349A-5 


Sequence 


5, Appli 


793 


74 . 5 


2.7 


359 


4 


US-10-029-180-68 


Sequence 


68, Appl 


794 


74.5 


2.7 


366 


4 


US-09-501-115-10 


Sequence 


10, Appl 


795 


74.5 


2.7 


380 


4 


US-09-198-452A-62 6 


Sequence 


626, App 


796 


74. 5 


2.7 


380 


4 


US-09-438-185A-586 


Sequence 


586, App 


797 


74.5 


2.7 


' 386 


4 


US-09-198-452A-1046 


Sequence 


1046, Ap 


798., 


74.5 


2.7 


394 


4 


US-09-^38-185A-974 


. — . Sequence 


974, App 


799 


74.5 • 


2.7 


398 


4 


US-09-902-54 0-12305 


Sequence 


12305, A 


800 


74.5 


2.7 


398 


6 


5187089-7 


Patent No 


. 5187089 


801 


74.. 5 


2.7 


398 


6 


5187089-7 


Patent No 


. 5187089 


802 


74.5 


2.7 


399 


3 


US-09-347-801-14 


Sequence 


14, Appl 


803 


74.5 


2.7 


399 


4 


US-09-854-731-14 


Sequence 


14, Appl 


804 


74.5 


2.7 


409 


2 


US-08-924-254-2 


Sequence 


2, Appli 


805 


74.5 


.2.7 


409 . 


3 


US-09-120-249-2 


Sequence 


2, Appli 


806 


74.5 


2.7 


411 


3 


US-09-347-801-22 


Sequence 


22, Appl 


807 


74.5 


2.7 


411 


4 


US-09-854-731-22 


Sequence 


22, Appl 


808 


74.5 


2.7 


412 


3 


US-09-347-801-21 


Sequence 


21, Appl 


809 


74.5 


2.7 


412 


4 


US-09-854-731-21 


Sequence 


21, Appl 



810 


74.5 


2 


.7 


431 


2 


US-08-712-709-5 


Sequence 


5, Appli 


811 


74.5 


2 


.7 


.431 


3 


US-09-111-444-5 


Sequence 


5, Appli 


812 


74.5 


2 


.7 


431 


3 


US-09-541-228-5 


Sequence 


5, Appli 


813 


74.5 


2 


.7 


431 


3 


US-09-031-295-2 


Sequence 


2, Appli 


814 


74.5 


2 


.7 


434 


4 


US-09-543-681A-4577 


Sequence 


4577, Ap 


815 


74.5 


2 


.7 


440 


4 


US-10-029-180-124 


Sequence 


124, App 


816 


74.5 


2 


.7 


451 


4 


US-10-029-180-123 


Sequence 


123, App 


817 


74.5 


2 


.7 


458 


4 


US-09-583-110-5167 


Sequence 


5167, Ap 


818 


74.5 


2 


.7 


554 


1 


US-08-106-761-2 


Sequence 


2, Appli 


819 


74.5 


2 


.7 


554 


4 


US-08-909-125-7 


Sequence 


7, Appli 


820 


74.5 


2 


.7 


591 


4 


US-09-543-681A-5171 


Sequence 


5171, Ap 


821 


74.5 


2 


.7 


612 


4 


US-09-248-796A-18848 


Sequence 


18848, A 


822 


74.5 


2 


.7 


656 


4 


US-09-489-039A-8212 


Sequence 


8212, Ap 


823 


74.5 


2 


.7 


720 


4 


US-09-583-110-2940 


Sequence 


2940, Ap 


824 


74.5 


2 


.7 


720 


4 


US-09-107-433-4193 


Sequence 


4193, Ap 


825 


74.5 


2 


.7 


726 


4 


US-09-417-197-71 


Sequence 


71, Appl 


826 


74.5 


2 


.7 


790 


2 


US-08-359-705B-9 


Sequence 


9, Appli 


827 


74.5 


2 


.7 


790 


2 


US-08-286-846A-9 


Sequence 


9, Appli 


828 


74.5 


2 


.7 


790 


2 


US-08-457-880A-9 


Sequence 


9, Appli 


829 


74.5 


2 


.7 


790 


3 


US-08-444-622A-9 


Sequence 


9, Appli 


830 


74.5 


2 


.7 


790 


3 


US-08-942-562-9 


Sequence 


9, Appli 


831 


74.5 


2 


.7 


790 


3 


US-09-156-923-9 


Sequence 


9, Appli 


832 


74.5 


2 


.7 


814 


1 


US-08-286-305A-3 


Sequence 


3, Appli 


833 


74.5 


2 


.7 


814 


2 


US-08-441-104A-3 


Sequence 


3, Appli 


834 


74.5 


2 


.7 


814 


2 


US-08-440-816A-3 


Sequence 


3, Appli 


835 


74.5 


2 


.7 


814 


3 


US-09-417-381A-3 


Sequence. 


3, Appli 


836 


74.5 


2 


.7 


834 


3 


US-09-143-571-29 


Sequence 


29, Appl 


837 


74.5 


2 


.7 


834 


4 


US-09-470-276-2 


Sequence 


2, Appli 


838 


74 . 5 


2 


.7 


855 


4 


US-09-949-016-11016 


Sequence 


11016, A 


839 


74.5 


2 


.7 


859 


4 


US-09-538-092-206 


Sequence 


206, App 


840 


74 . 5. 


2 


.7 


1071 


2 


US-08-975-527-1 


Sequence 


1, Appli 


841 


74.5 


2 


.7 


1489 


4 


US-09-538-092-304 


Sequence 


304, App 


842 


74.5 


2 


.7 


2262 


4 


US-09-949-016-8849 


Sequence 


8849, Ap 


843 


74 


2 


.7 


397 


6 


5187089-9 


Patent No. 


, 5187089 


844 


74 


2 


.7 


397 


6 


5457090-4 


Patent No. 


, 5457090 


845 


74 


2 


.7 


397 


6 


5187089-9 


Patent No, 


, 5187089 


846 


74 


2 


.7 


397 


6 


5457090-4 


Patent No. 


. 5457090 


847 


74 


2 


.7 


427 


3 


US-09-134-001C-5397 


Sequence 


5397, Ap 


848 


74 


2 


.7 


430 


4 


US-09-198-452A-497 


Sequence 


497, App 


849 


74 


2 


.7 


430 


4 


US-09-438-185A-465 


Sequence 


4 65, App 


850 


74 


2 


.7 


443 


4 


US-09-134-000C-4800 


Sequence 


4800, Ap 


851 


74 


2 


.7 


453 


4 


US-09-769-863-14 


Sequence 


14, Appl 


852 


74 


2 


.7 


467 


4 


US-09-107-532A-6186 


Sequence 


6186, Ap 


853 


74 


2 


.7 


474 


4 


US-09-248-796A-15747 


Sequence 


15747, A 


854 


74 


2 


.7 


481 


2 


US-08-477-451-19 


Sequence 


19, Appl 


855 


74 


. 2 


.7 


501 


4 


US-09-538-092-692 


Sequence 


692, App 


856 


74 


2 


.7 


528 


4 


US-09-710-279-1930 


Sequence 


1930, Ap 


857 


74 


2 


.7 


557 


4 


US-09-583-110-4748 


Sequence 


4748, Ap 


858 


74 


2 


.7 


573 


4 


US-09-107-433-3736 


Sequence 


3736, Ap 


859 


74 " 


2 


.7 


584 


4 


US-09-604-957-6 


Sequence 


6, Appli 


860 


74 


2 


.7 


610 


4 


US-09-248-796A-18471 


Sequence 


18471, A 


861 


74 


2 


.7 


615 


4 


US-09-388-743-2 


Sequence 


2, Appli 


862 
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Sequence 


7, Appli 


1206 


71 


2 


.6 


709 


1 


US-08-328-809-2 


Sequence 


2, Appli 


1207 


71 


2 


.6 


709 


3 


US-09-015-003-2 


Sequence 


2, Appli 


1208 


71 


2 


.6 


709 


4 


US-08-866-840-2 


Sequence 


2, Appli 



1209 


71 


2. 


6 


709 


4 


US-09-489-039A-11018 


Sequence 


11018, A 


1210 


71 


2. 


6 


709 


4 


US-09-538-092-1287 


. Sequence 


1287, Ap 


1211 


71 


2. 


6 


709 


5 


PCT-US92-11107-7 


Sequence 


7, Appli 


1212 


71 


2. 


6 


714 


4 


US-09-543-681A-6809 


Sequence 


6809, Ap 


1213 


71 


2. 


6 


719 


4 


US-09-949-016-9561 


Sequence 


9561, Ap 


1214 


71 


2. 


6 


749 


4 


US-09-949-016-10583 


Sequence 


10583, A 


1215 


71 


2. 


6 


785 


4 


US-09-079-030-216 


Sequence 


216, App 


1216 


71 


2. 


6 


803 


2 


US-08-907-166-4 


Sequence 


4, Appli 


1217 


71 


2. 


6 


803 


4 


US-09-391-340-4 


Sequence 


4, Appli 


1218 


71 


2. 


6 


839 


4 


US-09-758-282B-234 


Sequence 


234, App 


1219 


71 


2. 


6 


839 


4 


US-09-577-304A-234 


Sequence 


234, App 


1220 


71 


2. 


6 


917 


4 


US-09-248-796A-19347 


Sequence 


19347, A 


1221 


71 


2. 


6 


1132 


4 


US-09-528-784A-87 


Sequence 


87, Appl 


1222 


71 


2. 


6 


1132 


4 


US-09-569-098A-87 


Sequence 


87, Appl 


1223 


71 


2. 


6 


1285 


2 


US-08-540-406-6 


Sequence 


6, Appli 


1224 


71 


2. 


6 


1285 


3 


US-08-656-055-6 


Sequence 


6, Appli 


1225 


71 


2. 


6 


1285 


3 


US-08-954-668-6 


Sequence 


6, Appli 


1226 


71 


2. 


6 


1285 


4 


US-08-918-658-6 


Sequence 


6, Appli 


1227 


71 


2 . 


6 


1285 


4 


US-09-724-631-6 


Sequence 


6, Appli 


1228 


71 


2. 


6 


1285 


4 


US-08-954-701A-6 


Sequence 


6, Appli 


1229 


71 


2 . 


6 


1285 


5 


PCT-US95-13233-6 


Sequence 


6, Appli 


1230 


71 


2 . 


6 


1440 


3 


US-09-357-251-37 


Sequence 


37, Appl 


1231 


71 


2 . 


6 


2516 


4 


US-09-94 9-016-10280 


Sequence 


10280, A 


1232 


71 


2 . 


6 


3056 


1 


US-0 8-50 8-8 36A-8 


Sequence 


8, Appli 


1233 


71 


2 . 


6 


3056 


2 


US-08-629-001A-3 


Sequence 


3, Appli 


1234 


71 


2 . 


6 


3056 


2 


US-08-874-266-2 


Sequence 


2, Appli 


1235 


71 


2. 


6 


3056 


3 


US-08-642-274D-3 


Sequence 


3, Appli 


1236 


71 


2. 


6 


3056 


3 


US-08-952-127-3 


Sequence 


3, Appli 


1237 


71 


2 . 


6 


3056 


3 


US-08-952-014C-3 


Sequence 


3, Appli 


1238 


71 


2. 


6 


3056 


4 


US-09-360-416-2 


Sequence 


2, Appli 


1239 


71 


2 . 


6 


3056 


4 


US-08-984-090-2 


Sequence 


2, Appli 


1240 


71 


2 . 


6 


3057 


4 


US-09-360-416-3 


Sequence 


3, Appli 


1241 


71 


2 . 


6 


3340 


4 


US-09-252-991A-23568 


Sequence 


23568, A 


1242 


71 


2 . 


6 


3559 


4 


US-09-693-205A-10 


Sequence 


10, Appl 


1243 


71 


2 . 


6 


15281 


2 


US-08-471-119A-2 


Sequence 


2, Appli 


1244 


70.5 


2. 


5 


211 


4 


US-09-322-409-78 


Sequence 


78, Appl 


1245 


70.5 


2. 


5 


211 


4 


US-09-451-527-78 


Sequence 


78, Appl 


1246 


70.5 


2 . 


5 


260 


4 


US-09-322-409-73 


Sequence 


73, Appl 


1247 


70.5 


2 . 


5 


260 


4 


US-09-45 1-527-7 3 


Sequence 


73, Appl 


1248 


70.5 


2 . 


5 


273 


4 


US-09-32 8-352-5843 


Sequence 


5843, Ap 


1249 


70.5 


2 . 


5 


273 


4 


US-08-956-171E-5196 


Sequence 


5196, Ap 


1250 


70.5 


2 . 


5 


273 


4 


US-09-710-279-752 


Sequence 


752, App 


1251 


70.5 


2. 


5 


273 


4 


US-08-781-986A-5196 


Sequence 


5196, Ap 


1252 


70.5 


2. 


5 


279 


4 


US-09-328-352-5581 


Sequence 


5581, Ap 


1253 


70.5 


2 . 


5 


291 


4 


US-09-107-532A-6097 


Sequence 


6097, Ap 


1254 


70.5 


2. 


5 


296 


4 


.....US- 09-54 0-236-2 650 


Sequence 


2650, Ap 


1255 


70.5 


2 . 


5 


325 


4 


US-09-7 10-279-3296 


Sequence 


3296, Ap 


1256 


70.5 


2 . 


5 


332 


4 


US-09-561-763-5 


Sequence 


5, Appli 


1257 


70.5 


2. 


5 


332 


4 


US-09-431-367B-5 


Sequence 


5, Appli 


1258 


70.5 


2. 


5 


333 


3 


US-09-134-001C-5204 


Sequence 


5204, Ap 


1259 


70.5 


2. 


5 


333 


4 


US-09-583-110-3052 


Sequence 


3052, Ap 


1260 


70.5 


2. 


5 


375 


4 


US-09-107-532A-5089 


Sequence 


5089, Ap 


1261 


70.5 


2. 


5 


390 


3 


US-08-460-576-2 


Sequence 


2, Appli 


1262 


70.5 


2. 


5 


390 


4. 


US-09-134-000O6192 


Sequence 


6192, Ap 


1263 


70.5 


2. 


5 


393 


4 


US-09-270-767-42793 


Sequence 


42793, A 


1264 


70.5 


2. 


5 


421 


4 


US-09-470-175-2 


Sequence 


2, Appli 


1265 


70.5 


2. 


5 


429 


4 


US-09-252-991A-22564 


Sequence 


22564, A 



1266 


70 


.5 


2. 


5 


433 


4 


us- 


09- 


252- 


991A-31231 


Sequence 


31231, A 


-1267 


70 


.5 


2. 


5 


436 


3 


us- 


08- 


660- 


347-2 


Sequence 


2, Appli 


1268 


70 


.5 


2. 


5 


444 


4 


us- 


09- 


252- 


991A-31017 


Sequence 


31017, A 


1269 


70 


.5 


2. 


5 


471 


4 


us- 


09- 


949- 


016-8344 


Sequence 


8344, Ap 


1270 


70 


.5 


2. 


5 


484 


4 


us- 


09- 


710- 


279-2498 


Sequence 


2498, Ap 


1271 


70 


.5 


2. 


5 


485 


2 


us- 


08- 


749- 


391-2 


Sequence 


2, Appli 


1272 


70 


.5 


2. 


5 


485 


3 


us- 


09- 


390- 


200-2 


Sequence 


2, Appli 


1273 


70 


.5 


2. 


5 


509 


4 


us- 


09- 


902- 


540-12233 


Sequence 


12233, A 


1274 


70 


.5 


2. 


5 


513 


4 


us- 


09- 


248- 


796A-20548 


Sequence 


20548, A 


1275 


70 


.5 


2. 


5 


524 


3 


us- 


08- 


957- 


302A-2 


Sequence 


2, Appli 


1276 


70 


.5 


2. 


5 


524 


3 


us- 


09- 


542- 


403-2 


Sequence 


2, Appli 


1277 


70 


.5' 


2. 


5 


529 


1 


us- 


08- 


548- 


509-2 


Sequence 


2, Appli 


1278 


70 


.5 


2. 


5 


533 


4 


us- 


09- 


328- 


352-8174 


Sequence 


8174, Ap 


1279 


70 


.5 


2. 


5 


538 


4 


us- 


09- 


710- 


279-260 


Sequence 


260, App 


1280 


70 


.5 


2. 


5 


542 


4 


us- 


09- 


107- 


532A-4858 


Sequence 


4858, Ap 


1281 


70 


.5 


2. 


5 


551 


3 


us- 


08- 


699- 


103B-25 


Sequence 


25, Appl 


1282 


70 


.5 


2. 


5 


551 


3 


us- 


09- 


229- 


059-25 


Sequence 


25, Appl 


1283 


70 


.5 


2. 


5 


551 


4 


us- 


09- 


628- 


133-25 


Sequence 


25, Appl 


1284 


70 


.5 


2. 


5 


560 


2 


us- 


08- 


756- 


317-11 


Sequence 


11, Appl 


1285 


70 


.5 


2. 


5 


567 


3 


us- 


09- 


134- 


001C-5646 


Sequence 


5646, Ap 


1286 


70 


.5 


2. 


5 


631 


4 


us- 


09- 


134- 


000C-6175 


Sequence 


6175, Ap 


1287 


70 


.5 


2. 


5 


661 


4 


us- 


09- 


438- 


185A-803 


Sequence 


803, App 


1288 


70 


.5 


2. 


5 


672 


4 


us- 


09- 


270- 


767-46242 


Sequence 


46242, A 


1289 


70 


.5 


2. 


5 


681 


4 


us- 


09- 


252- 


991A-25690 


Sequence 


25690, A 


1290 


70 


.5 


2. 


5 


686 


3 


us- 


09- 


•245- 


041-13 


Sequence 


13, Appl 


1291 


70 


.5 


2. 


5 


686 


4 


us- 


09- 


358- 


055B-13 


Sequence 


13, Appl 


1292 


70 


.5 


2. 


5 


686 


4 


us- 


09- 


893- 


■238-13 


Sequence 


13, Appl 


1293 


70 


.5 


2. 


5 


765 


4 


us- 


09- 


975- 


•326-4 


Sequence 


4, Appli 


1294 


70 


.5 


2. 


5 


765 


4 


us- 


10- 


217- 


■357-4 


Sequence 


4, Appli 


1295 


70 


.5 


2. 


5 


766 


4 


us- 


09- 


-975- 


■326-2 


Sequence 


2, Appli 


1296 


70 


.5 


2. 


5 


766 


4 


us- 


•10- 


•217- 


•357-2 


Sequence 


2, Appli 


1297 


70 


.5 


2. 


5 


818 


4 


us- 


■09- 


•134- 


■000C-5599 


Sequence 


5599, Ap 


1298 


70 


.5 


2. 


5 


839 


4 


us- 


09- 


-538- 


•092-274 


Sequence 


274, App 


1299 


70 


.5 


2. 


5 


888 


4 


us- 


09- 


•252- 


•991A-17967 


Sequence 


17967, A 


1300 


70 


.5 


2. 


5 


1027 


4 


us- 


•09- 


•902- 


•540-11750 


Sequence 


11750, A 


1301 


70 


• 5 


2. 


5 


1065 


4 


us- 


•09- 


•710- 


•279-1212 


Sequence 


1212, Ap 


1302 


70 


.5 


2. 


5 


1067 


3 


us- 


■09- 


•134- 


-001C-3800 


Sequence' 


3800, Ap 


1303 


70 


.5 


2. 


5 


1068 


3 


us- 


•09- 


-085- 


-199B-11 


Sequence 


11, Appl 


1304 


70 


.5 


2. 


5 


1181 


4 


us- 


•09- 


-000- 


-004A-4 


Sequence 


4, Appli 


1305 


70 


.5 


2. 


5 


1181 


4 


us- 


-09- 


-949- 


•016-6189 


Sequence 


618 9, Ap 


1306 


70 


.5 


2. 


5 


1195 


4 


us- 


-09- 


-538- 


-092-517 


Sequence 


517, App 


1307 


70 


.5 


2. 


5 


1195 


4 


us- 


•09- 


-949- 


•016-10747 


Sequence 


10747, A 


1308 


70 


.5 


2. 


5 


1429 


3 


us- 


■09- 


-245- 


-041-130 


Sequence 


130, App 


1309 


70 


.5 


2. 


5 


1429 


4 


us- 


•09- 


-358- 


-055B-131 


Sequence 


131, App 


1310 


70 


.5 


2. 


5 


1433 


4 


us- 


•09- 


-543- 


-681A-7342 


Sequence 


7342, Ap 


1311 


70 


.5 


2. 


5 


1500 


3 


us- 


•09- 


-323t 


:.47.2A-4 


Sequence 


4, Appli 


1312 


70 


.5 


2. 


5 


1500 


4 


us- 


•09- 


-585- 


-077C-4 


Sequence 


4, Appli 


1313 


70 


.5 


2. 


5 


1500 


4 


us- 


•09- 


-538- 


-092-1051 


Sequence 


1051, Ap 


1314 


70 


.5 


2. 


5 


1909 


4 


us- 


-09- 


-590- 


-968B-2 


Sequence 


2, Appli 


1315 


70 


.5 


2. 


,5 


2257 


4 


us- 


-09- 


-839- 


-477-8 


Sequence 


8, Appli 


1316 




70 


2. 


,5 


218 


4 


us- 


-09- 


-543- 


-681A-7848 


Sequence 


7848, Ap 


1317 




70 


2. 


,5 


224 


4 


us- 


-09- 


-902- 


-540-11622 


Sequence 


11622, A 


1318 




70 


2. 


,5 


230 


4 


us- 


-09- 


-090- 


-672B-33 


Sequence 


33, Appl 


1319 




70 


2. 


.5 


258 


4 


us- 


•09- 


-248- 


-796A-17206 


Sequence 


17206, A 


1320 




70 


2, 


,5 


262 


4 


us- 


-09- 


-248- 


-796A-15819 


Sequence 


15819, A 


1321 




70 


2. 


,5 


267 


4 


us- 


-09- 


-134- 


-000C-6273 


Sequence 


6273, Ap 


1322 




70 


2. 


.5 


267 


4 


us- 


-09- 


-902- 


-540-14651 


Sequence 


14651, A 



1323 


70 


2. 


5 


270 


4 


US-09-328-352-6533 


Sequence 


6533, Ap 


1324 


70 


2. 


5 


. 283 


4 


US-09-328-352-5031 


Sequence 


5031, Ap 


1325 


70 


2. 


5 


321 


4 


US-09-489-039A-9590 


Sequence 


9590, Ap 


1326 


70 


2. 


5 


332 


4 


US-09-489-039A-12580 


Sequence 


12580, A 


1327 


70 


2. 


5 


340 


4 


US-09-538-092-1103 


Sequence 


1103, Ap 


1328 


70 


2. 


5 


343 


4 


US-09-252-991A-16742 


Sequence 


16742, A 


1329 


70 


2. 


5 


349 


4 


US-09-252-991A-28248 


Sequence 


28248, A 


1330 


70 


2. 


5 


369 


4 


US-09-949-016-7624 


Sequence 


7624, Ap 


1331 


70 


2. 


5 


376 


4 


US-09-270-7 67-35078 


Sequence 


35078, A 


1332 


70 


2. 


5 


376 


4 


US-09-270-7 67-502 95 


Sequence 


50295, A 


1333 


70 


2. 


5 


385 


3 


US-09-134-001C-5166 


Sequence 


5166, Ap 


1334 


70 


2. 


5 


427 


4 


US-09-826-509-495 


Sequence 


495, App 


1335 


70 


2. 


5 


458 


4 


US-09-673-395A-226 


Sequence 


226, App 


1336 


70 


2. 


5 


466 


1 


US-08-334-698-6 


Sequence 


6, Appli 


1337 


70 


2. 


5 


466 


1 


US-08-228-932-6 


Sequence 


6, Appli 

f tr tr 


1338 


70 


2 . 


5 


466 


1 


US-08-468-939-6 


Sequence 


6, Appli 


1339 


70 


2 . 


5 


466 


1 


US-08-722-001-28 


Sequence 


2 8 , Appl 


1340 


70 


2 . 


5 


466 


2 


US-08-406-855A-6 


Sequence 


6, Appli 


1341 


70 


2 . 


5 


466 


2 


US-08-722-190-6 


Sequence 


6, Appli 


1342 


70 


2 . 


5 


466 


3 


US-08-244-354-6 


Sequence 


6, Appli 

t * *ir tr 


1343 


70 


2 . 


5 


466 


3 


US-09-206-899-6 


Sequence 


6, Appli 


1344 


70 


2 . 


5 


466 


4 


US-09-444-783-6 


Sequence 


6, Appli 


1345 


70 


2 . 


5 


466 


4 


US-09-688-415-6 


Sequence 


6, Appli 


1346 


70 


2 . 


5 


466 


4 


US-09-444-783-6 


Sequence 


6* Appli 

w r * tr tr 


1347 


70 


2. 


5 


466 


5 


PGT-US95- 04203-6 


Sequence 


6, Appli 


1348 


70 


2 . 


5 


472 


4 


Us- 09-54 0-236-37 60 


Sequence 


3760, Ap 


1349 


70 


2 . 


5 


484 


4 


Us- 09-32 8-352- 6041 


Sequence 


6041, Ap 


1350 


70 


2. 


5 


562 


4 


US-09-489-039A-8049 


Sequence 


8049, Ap 


1351 


70 


2 . 


5 


565 


4 


US-09-252-991A-17458 


Sequence 


17458, A 


1352 


70 


2 . 


5 


574 


4 


US-10-023-515-4 


Sequence 


4, Appli 


1353 


70 


2 . 


5 


575 


4 


US-09-543-681A-6584 


Sequence 


6584, Ap 


1354 


70 


2 . 


5 


585 


4 


US-09-248-796A-17511 


Sequence 


17511, A 


1355 


70 


2 . 


5 


606 


4 


US-09-4 8 9-039A-7827 


Sequence 


7827, Ap 


1356 


70 


2 . 


5 


607 


2 


US-08-752-307B-12 


Sequence 


12, Appl 


1357 


70 


2 . 


5 


607 


3 


Us- 09-7 07- 8 02- 12 


Sequence 


12, Appl 


1358 


70 


2. 


5 


607 


3 


US-09-991-326-12 


Sequence 


12, Appl 

' tr tr 


1359 


70 


2 . 


5 


612 


4 


US-09-248-796A-18040 


Sequence 


18040, A 


1360 


70 


2 . 


5 


636 


4 


US-09-270-7 67-4524 5 


Sequence 


45245, A 


1361 


70 


2 . 


5 


682 


4 


US-09-53 8-092-223 


Sequence 


223, App 

f It tr 


1362 


70 


2 . 


5 


704 


4 


US-09-107-532A-5612 


Sequence 


5612, Ap 


1363 


70 


2 . 


5 


723 


4 


US-09-252-991A-18279 


Sequence 


18279, A 


1364 


70 


2 . 


5 


767 


4 


US-09-328-352-4613 


Sequence 


4613, Ap 


1365 


70 


2 . 


5 


817 


4 


US-09-543-681A-4637 


Sequence 


4637, Ap 


1366 


70 


2. 


5 


818 


4 


US-09-949-016-8579 


Sequence 


8579, Ap 


1367 


70 


2 . 


5 


820 


4 


US-09-949-016-6134 


Sequence 


6134, Ap 


1368 


70 


2 . 


5 


846 


1 


US-08-357-598-12_. 


Sequence 


12, Appl 


1369 


70 


2 . 


5 


846 


2 


US-09-003-289-12 


Sequence 


12, Appl 


1370 


70 


2. 


5 


846 


5 


PCT-US95-16435-12 


Sequence 


12, Appl 


1371 


70 


2. 


5 


868 


4 


US-09-538-092-787 


Sequence 


787, App 


1372 


70 


2. 


5 


952 


4 


US-09-248-796A-20878 


Sequence 


20878, A 


1373 


70 


2 . 


5 


987 


1 


US-08-436-044-6 


Sequence 


6, Appli 

* tr tr 


1374 


70 


2. 


5 


987 


2 


US-08-436-054-6 


Sequence 


6, Appli 


1375 


70 


2. 


5 


987 


5 


PCT-US95-08812-6 


Sequence 


6, Appli 


1376 


70 


2. 


5 


989 


3 


US-09-110-517-4 


Sequence 


4, Appli 


1377 


70 


2. 


5 


989 


4 


US-09-710-279-2594 


Sequence 


2594, Ap 


1378 


70 


2. 


5 


990 


4 


US-09-949-016-7235 


Sequence 


7235, Ap 


1379 


70 


2. 


5 


998 


4 


US-09-949-016-8326 


Sequence 


8326, Ap 



1380 


70 


2. 


5 


1145 


4 


US-09-438-185A-191 


Sequence 


191, App 


1381 


70 


2. 


5 


1257 


2 


US-08-750-152A-2. 


Sequence 


2, Appli 


1382 


70 


2. 


5 


1276 


1 


US-08-222-616-24 


Sequence 


24, Appl 


1383 


70 


2. 


5 


1276 


3 


US-08-446-648-24 


Sequence 


24, Appl 


1384 


70 


2. 


5 


1276 


4 


US-09-982-610-24 


Sequence 


24, Appl 


1385 


70 


2. 


5 


1276 


5 


PCT-US95-04228-24 


Sequence 


24, Appl 


1386 


70 


2. 


5 


1967 


4 


US-09-849-602-16 


Sequence 


16, Appl 


1387 


70 


2. 


5 


3174 


2 


US-08-477-451-3 


Sequence 


3, Appli 


1388 


69.5 


2. 


5 


145 


4 


US-09-107-532A-6302 


Sequence 


6302, Ap 


1389 


69.5 


2. 


5 


215 


4 


US-09-711-164-361 


Sequence 


361, App 


1390 


69.5 


2. 


5 


220 


1 


US-07-820-154A-4 


Sequence 


4, Appli 


1391 


69.5 


2. 


5 


220 


2 


US-08-097-554A-4 


Sequence 


4 , Appli 


1392 


69.5 


2. 


5 


220 


3 


US-08-480-640A-4 


Sequence 


4, Appli 


1393 


69.5 


2 . 


5 


220 


3 


US-08-295-802-4 


Sequence 


4, Appli 


1394 


69.5 


2. 


5 


220 


3 


US-08-488-237A-4 


Sequence 


4, Appli 


1395 


69.5 


2. 


5 


220 


3 


US-08-375-992A-4 


Sequence 


4, Appli 


1396 


69.5 


2. 


5 


220 


4 


US-08-472-679H-4 


Sequence 


4, Appli 


1397 


69.5 


2. 


5 


220 


5 


PCT-US93-00324-4 


Sequence 


4, Appli 


1398 


69.5 


2. 


5 


230 


2 


US-08-537-400-16 


Sequence 


16, Appl 


1399 


69.5 


2. 


5 


230 


2 


US-08-706-702-18 


Sequence 


18, Appl 


1400 


69.5 


2. 


5 


230 


3 


US-08-706-706-18 


Sequence 


18, Appl 


1401 


69.5 


2. 


5 


230 


4 


US-09-238-471-18 


Sequence 


18, Appl 


1402 


69.5 


2. 


5 


262 


4 


US-09-710-279-876 


Sequence 


876, App 


1403 


69.5 


2. 


5 


263 


3 


US-09-134-001C-5231 


Sequence 


5231, Ap 


1404 


69.5 


2. 


5 


303 


4 


US-09-543-681A-7924 


Sequence 


7924, Ap 


1405 


69.5 


2. 


5 


320 


3 


US-08-793-666-10 


Sequence 


10, Appl 


1406 


69.5 


2. 


5 


331 


4 


US-09-107-433-3138 


Sequence 


3138, Ap 


1407 


69.5 


2. 


5 


337 


4 


US-09-198-452A-630 


Sequence 


630, App 


1408 


69.5 


2. 


5 


338 


4 


US-09-328-352-4723 


Sequence 


4723, Ap 


1409 


69.5 


2. 


5 


338 


4 


US-09-248-796A-19784 


Sequence 


19784, A 


1410 


69.5 


2. 


5 


342 


4 


US-09-248-796A-19527 


Sequence 


19527, A 


1411 


69. 5 


2. 


5 


344 


1 


US-08-403-866-9 


Sequence 


9, Appli 


1412 


69.5 


2. 


5 


345 


3 


US-09-173-581-6 


Sequence 


6, Appli 


1413 


69.5 


2. 


5 


345 


3 


US-09-420-915-6 


Sequence 


6, Appli 


1414 


69.5 


2. 


5 


355 


4 


US-09-175-684A-9 


Sequence 


9, Appli 


1415 


69.5 


2. 


5 


361 


4 


US-09-175-684A-10 


Sequence 


10, Appl 


1416 


69.5 


2. 


5 


370 


3 


US-09-134-001C-3769 


Sequence 


3769, Ap 


1417 


69.5 


2. 


5 


372 


4 


US-09-248-796A-15166 


Sequence 


15166, A 


1418 


69.5 


2. 


5 


379 


4 


US-09-248-796A-15312 


Sequence 


15312, A 


1419 


69.5 


2. 


5 


382 


3 


US-09-134-001C-3447 


Sequence 


3447, Ap 


1420 


69.5 


2. 


5 


397 


4 


US-09-438-185A-588 


Sequence 


588, App 


1421 


69.5 


2. 


5 


398 


6 


5187089-5 


Patent No. 


. 5187089 


1422 


69.5 


2. 


5 


398 


6 


5187089-5 


Patent No. 


. 5187089 


1423 


69.5 


2. 


5 


454 


4 


US-09-248-796A-20204 


Sequence 


20204, A 


1424 


69.5' 


2. 


5 


476 


4 


US-09-198-452A-1021 


Sequence 


1021, Ap 


1425 


69.5 


2. 


5 


477 


4 


US-09-583-110-3555 


Sequence 


3555, Ap 


1426 


69.5 


2. 


5 


478 


4 


US-09-248-796A-20836 


Sequence 


20836, A 


1427 


69.5 


2. 


5 


479 


4 


US-09-438-185A-950 


Sequence 


950, App 


1428 


69.5 


2. 


5 


502 


4 


US-09-134-000C-4863 


Sequence 


4863, Ap 


1429 


69.5 


2. 


5 


502 


4 


US-09-248-796A-14780 


Sequence 


14780, A 


1430 


69.5 


2. 


5 


503 


4 


US-09-248-796A-18992 


Sequence 


18992, A 


1431 


69.5 


2. 


5 


505 


3 


US-08-627-907A-2 


Sequence 


2, Appli 


1432 


69.5 


2. 


,5 


506 


4 


US-09-902-540-15114 


Sequence 


15114, A 


1433 


69.5 


2. 


,5 


528 


4 


US-09-543-681A-4418 


Sequence 


4418, Ap 


1434 


69.5 


2. 


,5 


550 


4 


US-09-489-039A-13969 


Sequence 


13969, A 


1435 


69.5 


2. 


.5 


560 


4 


US-09-821-016-3 


Sequence 


3, Appli 


1436 


69.5 


2. 


.5 


560 


4 


US-10-266-787-3 


Sequence 


3, Appli 



1437 


69 


.5 


2. 


5 


613 


4 


us- 


09- 


117- 


415B-22 


Sequence 


22 , Appl 


1438 


69 


.'5 


2. 


5 


631 


4 


us- 


09- 


117- 


415B-18 


Sequence 


18, Appl 


1439 


69 


.5 


2. 


5 


631 


4 


us- 


09- 


117- 


415B-20 


Sequence 


20, Appl 


1440 


69 


.5 


2. 


5 


633 


4 


us- 


09- 


919- 


060-13 


Sequence 


13, Appl 


1441 


69 


.5 


2. 


5 


649 


4 


us- 


09- 


117- 


415B-16 


Sequence 


16, Appl 


1442 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


480- 


640A-115 


Sequence 


115, App 


1443 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


480- 


640A-193 


Sequence 


193, App 


1444 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


295- 


802-115 


Sequence 


115, App 


1445 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


686- 


968C-58 


Sequence 


58, Appl 


1446 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


686- 


968C-193 


Sequence 


193, App 


1447 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


488- 


237A-115 


Sequence 


115, App 


1448 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


488- 


237A-193 


Sequence 


193, App 


1449 


69 


.5 


2. 


5 


677 


3 


us- 


08- 


375- 


992A-115 


Sequence 


115, App 


1450 


69 


. 5 


2. 


5 


677 


3 


us- 


08- 


375- 


992A-193 


Sequence 


193, App 


1451 


69 


.5 


2. 


5 


677 


4 


us- 


08- 


■472- 


679H-115 


Sequence 


115, App 


1452 


69 


.5 


2. 


5 


677 


4 


us- 


08- 


472- 


679H-193 


Sequence 


193, App 


1453 


69 


.5 


2 . 


,5 


721 


4 


us- 


08- 


•851- 


435-2 


Sequence 


2, Appli 


1454 


69 


.5 


2. 


,5 


74 6 


4 


us- 


•08- 


•851- 


•435-6 


Sequence 


6, Appli 


1455 


69 


.5 


2. 


,5 


789 


4 


us- 


09- 


•248- 


•796A-19294 


Sequence 


19294, A 


1456 


69 


.5 


2. 


.5 


800 


4 


us- 


09- 


■270- 


•767-45282 


Sequence 


45282, A 


1457 


69 


. 5 


2 . 


.5 


816 


4 


us- 


09- 


■248- 


-796A-20939 


Sequence 


20939, A 


1458 


69 


. 5 


2, 


, 5 


839 


4 


us- 


•09- 


-758- 


■282B-230 


Sequence 


230, App 


1459 


69 


. 5 


2. 


,5 


839 


4 


us- 


■09- 


-577- 


-304A-230 


Sequence 


230, App 


1460 


69 


. 5 


2. 


.5 


864 


4 


us- 


•09- 


-489- 


-039A-12869 


Sequence 


12869, A 


1461 


69 


. 5 


2 , 


. 5 


864 


4 


us- 


09- 


-751- 


-687-18 


Sequence 


18, Appl 


1462 


69 


. 5 


2 . 


.5 


959 


4 


us- 


-09- 


-248- 


-796A-20776 


Sequence 


20776, A 


1463 


69 


.5 


2. 


.5 


1024 


4 


us- 


•09- 


-562- 


-737-42 


Sequence 


42, Appl 


1464 


69 


.5 


2. 


.5 


1050 


4 


us- 


•09- 


-902- 


-540-16666 


Sequence 


16666, A 


1465 


69 


. 5 


2 , 


. 5 


1085 


4 


us- 


-09- 


-198- 


-452A-961 


Sequence 


961, App 


1466 


69 


. 5 


2. 


.5 


1087 


4 


us- 


-09- 


-438- 


-185A-893 


Sequence 


893, App 


1467 


69 


. 5 


2, 


.5 


1091 


4 


us- 


-10- 


-162- 


-012-15 


Sequence 


15, Appl 


1468 


69 


.5 


2 


.5 


1211 


4 


us- 


-09- 


-328- 


-352-7967 


Sequence 


7967, Ap 


1469 


69 


.5 


2 


.5 


1260 


3 


us- 


-09- 


-245- 


-041-2 


Sequence 


2, Appli 


1470 


69 


. 5 


2 


.5 


1260 


4 


us- 


-09- 


-358- 


-055B-2 


Sequence 


2, Appli 


1471 


69 


.5 


2 


.5 


1260 


4 


us- 


-09- 


-893- 


-238-2 


Sequence 


2, Appli 


1472 


69 


.5 


2 


.5 


1724 


3 


us- 


-08- 


-857- 


-076-12 


Sequence 


12, Appl 


1473 


69 


. 5 


2 


.5 


4866 


4 


us- 


-09- 


-424- 


-783-2 


Sequence 


2, Appli 


1474 


69 


.5 


2 


.5 


6095 


3 


us- 


-09- 


-144- 


-085-2 


Sequence 


2, Appli 


1475 




69 


2 


.5 


203 


4 


us- 


-08- 


-956- 


-171E-5207 


Sequence 


5207, Ap 


1476 




69 


2 


.5 


203 


4 


us~ 


>08- 


-781- 


-986A-5207 


Sequence 


5207, Ap 


1477 




69 


2 


.5 


285 


4 


us- 


-09- 


-328- 


-352-6464 


Sequence 


6464, Ap 


1478 




69 


2 


.5 


314 


4 


us- 


-09- 


-583- 


-110-3992 


Sequence 


3992, Ap 


1479 




69 


2 


.5 


315 


4 


us- 


-09- 


-107- 


-433-4625 


Sequence 


4625, Ap 


1480 




69 


2 


.5 


342 


4 


us- 


-09- 


-248- 


-796A-15902 


Sequence 


15902, A 


1481 




69 


2 


.5 


358 


4 


us- 


-09- 


-198- 


-452A-188 


Sequence 


188, App 


1482 




69 




.5 


358 


4 


us- 


-09- 


-438- 


-185A-172 


Sequence. 


.172, App 


1483 




69 


2 


.5 


369 


4 


us- 


-09- 


-710- 


-279-1390. 


Sequence 


1390, Ap 


1484 




69 


2 


.5 


369 


4 


us- 


-09- 


-710- 


-279-1610 


Sequence 


1610, Ap 


1485 




69 


2 


.5 


372 


4 


us- 


-09- 


-270- 


-767-46648 


Sequence 


46648, A 


1486 




69 


2 


.5 


373 


4 


us- 


-09- 


-248- 


-796A-14832 


Sequence 


14832, A 


1487 




69 


2 


.5 


385 


4 


us- 


-09- 


-538- 


-092-333 


Sequence 


333, App 


1488 




69 


2 


.5 


389 


3 


us- 


-09- 


-307- 


-621-2 


Sequence 


2, Appli 


1489 




69 


2 


.5 


393 


4 


us- 


-09- 


-248- 


-7'96A-19608 


Sequence 


19608, A 


1490 




69 


2 


.5 


397 


6 


5187089-2 




Patent No 


. 5187089 


1491 




69 


2 


.5 


397 


6 


5187089-2 




Patent No. 5187089 


1492 




69 


2 


.5 


407 


4 


us- 


-09 


-270 


-767-43872 


Sequence 


43872, A 


1493 




69 


2 


.5 


419 


2 


us- 


-08 


-933 


-750C-30 


Sequence 


30, Appl 



1494 


69 


2.5 


419 


3 


US-09-234- 


•613-30 


Sequence 


30, Appl 


1495 


69 • 


2.5 


452 


4 


US-09-248- 


•796A-16171 


Sequence 


16171, A 


1496 


69 


2.5 


455 


3 


US-09-038- 


-217A-18 


Sequence 


18, Appl 


1497 


69 


2.5 


455 


4 


US-09-447- 


•034-18 


Sequence 


18, Appl 


1498 


69 


2.5 


467 


4 


US-09-198- 


-452A-1218 


Sequence 


1218, Ap 


1499 


69 


2.5 


475 


3 


US-09-212- 


-247C-4 


Sequence 


4, Appli 


1500 


69 


2.5 


484 


6 


5171673-8 




Patent No. 


5171673 



ALIGNMENTS 



RESULT 1 

US-09-949-016-5947 

Sequence 5947, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al . 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 
TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 
THEREOF 

FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949, 016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 

PRIOR APPLICATION NUMBER: 60/237,768 i 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207012 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 5947 
LENGTH: 533 
TYPE: PRT 
ORGANISM: Human 
US-09-949-016-5947 

Query Match 24.5%; Score 679; DB 4; Length 533; 

Best Local Similarity 33.8%; Pred. No. 2.4e-63; 

Matches 175; Conservative 93; Mismatches 211; Indels 38; Gaps 13; 

Qy 8 LLVGFLL— PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I : : I I I I : : I I I I I I I I I I : I I I I I I I :: I I 
Db 11 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 62 

Qy 66 D FKKEEKS YQVI SWLAPEDHQRE- FKKS F DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : : : 

Db 63 DASLYI RDGAFYTLKTY — PVPFQREDVKES FVSLGHNVFENDS FLQRVI KTYKKIK 117 

Qy 112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPBVAILSTSFG 171 

I I I I I : I : : I I I : I I : : : : I I : : I : I II I 
Db 118 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 

Qy 172 SLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

II I I I I I I I I I : I I I I llllhll : I : I : 

Db 178 SLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDWYSPY-ATLASE 236 



Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDF7VRPLLPNTVYVGGLMEKPIKPVPQDLENFIA 288 . 

I : : II I : I III I : I I : : I I I : I I I : | : [ : I : I 

Db 237 FLQ-REWVQDLLSSASWLFRSDFWDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 295 

Qy 289 KFGDSGFXTLVTLGSMWTCQNPEIFKEMNN^^ 34 8 

I : I I : : I I I I I : : : : I : I I I : I : : : : I I I : 
Db 296 ASGEHGIWFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSNLANNTIL 351 

Qy 349 VDWLPQSDLI^PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGV 408 

11111:11111 I I : I I I : : I : I : I I I I I : I I I I I I : I hi I II 
Db 352 VKWLPQNDLLGHPMTRAPITHAGSHGWESICNGVP 411 



Qy 409 S IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLSPTQRLVGWI DHVLQTGGA 468 

:::::: I I : I : : I I I I : I : : I : I I I : : I : : II 

Db. 412 TL^^^EMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGA 471 

Qy 469 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I I : : : I I I I I : I : : I 

Db 472 PHLRPAAHDLTWYQ YHS LDVI GFLLAWLTVAFI T FK 508 



RESULT 2 

US-09-949-016-7684 

; Sequence 7684, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CLO 01307 

; CURRENT APPLICATION NUMBER: US/09/94 9 , 016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 7684 

LENGTH: 538 

TYPE: PRT 
; ORGANISM: Human 
US-09-949-016-7684 

Query Match 24.5%; Score 679; DB 4; Length 538; i 

Best Local Similarity 33.8%; Pred. No. 2.4e-63; 

Matches 175; Conservative 93; Mismatches 211; Indels 38; Gaps 13; 

Qy 8 LLVGFLL — PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I : : I I I I : : I I I I I I I I I I : I I I I I I I : : I I 
Db 16 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 67 



Qy 



66 D FKKEEKS YQVI SWLAPEDHQRE- FKKS F- 



DFFLEETLGGRGKFENLL 111 



I : : : I : : : I Nihil III:: : : : 

Db . 68 DASLYI RDGAFYTLKT Y — PVPFQREDVKES FVSLGHNVFENDS FLQRVI KTYKKIK. 122 



Qy 112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I II I I : I : : I I I : I I : : : : I I : : I : I II I 
Db 123 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 182 

Qy 172 SLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

I I I I I I I I I I I Mill 11111:11 : I : I : 

Db 183 SLEFE^TQCPNPFSWPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDVVYSPY-ATLASE 241 

Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 288 

I : : II I : I III I : I I : : I I I : I I I : | : I : I : I 

Db 242 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 300 

Qy 289 KFGDSGFVLWLGSMVNTCQNPEIFKEMN^ 348 

I : I I : : I I I I I : : : : I : I I I : I : : : : I.I I : 
Db 301 ASGEHGI WFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSNLANNTIL 356 

Qy 349 VDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPEN1VIVRVEAKKFGV 408 

I I I I I : I I I I I I I : I I I : : I : I : I I I I I : I I I I II : I hi I M 
Db 357 VKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGV 416 

Qy 4 09 SIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGA 468 

:::::: I I : I : : I I I I : I : : I : I I I : : I : : II 

Db 417 TLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFV^^ 476 



Qy 469 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I I : : : II I I I : I : : I 

Db 477 PHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 513 



RESULT 3 

PCT-US92-00282-3 

; Sequence 3, Application PC/TUS9200282 
; GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 
; , APPLICANT: RITTER, JOSEPH K. 

" TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
; TITLE OF INVENTION: THEREIN. 

; NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 

STREET: 1615 L STREET, N.W. 
; CITY: WASHINGTON 

STATE: D.C. 

COUNTRY: U.S.A. 

ZIP: 20036-5601 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 00282 
; FILING DATE: 19920110 

CLASSIFICATION: 435 



ATTORNEY/AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

; REGISTRATION NUMBER: 26581 

; REFERENCE/ DOCKET NUMBER: 91532-PCT 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 
; TELEFAX: 202-822-0944 

TELEX: 6714627 CUSH 
; INFORMATION FOR SEQ ID NO: 3: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 533 amino acids 

TYPE: AMINO ACID 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-00282-3 



Query Match 24.3%; Score 674; DB 5; Length 533; 

Best Local Similarity 33.8%; Pred. No. 8.2e-63; 

Matches 175; Conservative 92; Mismatches 212; Indels 38; Gaps 13; 

Qy 8 LLVGFLL — PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I : : I I I I :: I I I I I I I I I I : I I I I I I I :: I l' 
Db 11 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 62 

Qy 66 D FKKEEKS YQVI SWLAPEDHQRE-FKKSF DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : : : 

Db 63 DASLYIRDGAFYTLKTY — PVP FQREDVKES FVS LGHNVFENDS FLQRVI KTYKKIK 117 



Qy 112 NVXEYLALQCSHFLNRKDIMDSLKNENFDMVI V^TFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I I I I : I : : I I I : I I : : : : I I : : I : I II I 
Db 118 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 

Qy 172 SLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

I I I I I I I II I I : I I I I Mllhll : I : I : 

Db 178 SLEFEATQCPNPFSWPRPLSSHSDHMTFLQRVl^NMLIAFSQNFLCDvVYSPY-ATLASE 236 

Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 288 

I : : I I I : I III I : I I : : I I I : I I I : I : I : I : I 

Db 237 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 295 

Qy 289 KFGDSGFVXWLGSMVTCTCQNPEIFKEMNNAiTAHLPQGV^ 348 

I : I I,: : I I I I I : : : : I I I I : I : : : : I I I : 
Db 296 ASGEHGIWFSLGSMVSEIPEKKAM-AIADALGKNPQTVLWRYTGT RPSNLANNTIL 351 



Qy , 349 VT5WLPQSDLLAHPSIRLEVTHGGQNSIMEAIQHGVPMTO 408 

11111:11111 I I : I I I : : I : I : I I I I I : I I I I I I : I hi I II 
Db 352 VKW L P QN D L LGH PMT RAFI T HAG S H G VY E S I CN GVPMVMMP L FG DQMDNAKRMET KGAGV 411 

Qy 409 SIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGA 468 

:::::: I I : I : : I I I I : I : : I : I I I : : I : : II 

Db 412 TLNVTiEMTSEDLEN/^KAVINDKSYKENIMRLSSLHKDRPVTlPLD^VFV^FVMRHKGA 471 



Qy 469 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I I : : : I I I I I : I : : I 

Db 472 PHLRPAAHDLTWYQ YHS LDVI GFLLAWLTVAFI T FK 508 



RESULT 4 

US-09-949-016-8465 

; Sequence 8465, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207 012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 8465 

LENGTH: 540 

TYPE: PRT 

ORGANISM: Human 
US-09-949-016-8465 

Query Match 24.0%; Score 663.5; DB 4; Length 540; 

Best Local Similarity 34.2%; Pred. No. l.le-61; 

Matches 173; Conservative 88; Mismatches 214; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVISWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 46 SHWMNIKTILDELVQRGHEVTVLASSASISFDPNSPSTLKFEVYPVSLTKTEFEDIIKQL 105 

Qy 90 KKS FDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145 

I : : : I I : : : : I I : : I : I I :. II : I : : 

Db 106 VKRWAELPKDTFW — SYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQESRFDWLAD 163 

Qy 146 TFDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPI PLS YVPVFRSLLTDHMDFWG 199 

II I : I I I III I I I : : I II I I I I I I I hi I I 

Db 164 A— VFPFGELLAELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 221 

Qy 200 RVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 257 

I I I I : : : : I I : I : : I I I I I : II : : I I : : I I 

Db 222 RVKNMI.YVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYWDFQF 278 

Qy 258 ARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEIFKEM 316 

I I I I I : I I I I I I I I : I : : : I I : I : : I I : : I I I I I I I : I ' : 
Db 279 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE — ERANVI 336 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVllLAANWIVl)WLPQSDLLAHPSIRLFvTHGGQNSIM 37 6 

: I I : I I I : I : : I I I :: I : I I : I I I I I I I : I I I I I I 
Db 337 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 393 

Qy 377 EAIQHGVPMVGIPLFGDQPENMWvT^KFGVSIQLKKLKAETLALKMKQIMEDKRYKSA 436 

I I I I I : I I I I : I I I I I I : I : : : I I I I : : : I : I : : I II 



Db 



394 EAIYHGIPMVGVPLFADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPLYKEN 45,3 



Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 496 

I : I I I : I : I I I I : I : : I I I I : I : : I I I I I 

Db 454 AMKLSRIHHDQPVKPLERAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACV 513 

Qy 497 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : I I I I : : I : I I 
Db 514 ATVI FI ITKCL- FCVWKFVRTGKKGK 538 



RESULT 5 

US-09-949-016-9052 

; Sequence 9052, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 9052 

LENGTH: 54 0 
; TYPE: PRT 

ORGANISM: Human 
US-09-949-016-9052 

Query Match 24.0%; Score 663.5; DB 4; Length 540; 

Best Local Similarity 34.2%; Pred. No. l.le-61; 

Matches 173; Conservative 88; Mismatches 214; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVI SWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 46 SHWMNI KTI LDELVQRGHEVTVLASSAS I S FDPNS PSTLKFEVYPVSLTKTEFEDI IKQL 105 

Qy 90 KKS FDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145...,,.. 

I : ::| I : :: : I I : : I : I I : ||:|: : 

Db 106 VKRWAELPKDTFW--SYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQESRFDVVLAD 163 

) 

Qy 146 TFDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFWG 199 

II I : I I I III I I I : : I II llllll I I : I I I 

Db 164 A— VFPFGELLAELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 221 



Qy 

Db 



200 RVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 257 

I I I I ::::! I : |: :| I I II : | | : : | I : : I I 

222 RVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYWDFQF 278 



Qy 258 ARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEIFKEM 316 

Mill : I I I I I I I I : I : : : I I : I .: : I I : : II I I I I I : I : • 
Db 279 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE--ERANVI 336 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVHLAAW 376 

: | | : I I I : I : : I I I : : I : I I : I I I I I I I : I I I I I I 
Db 337 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 393 

Qy 377 EAIQHGVPMVGIPLFGDQPEN1WRVEAKKFGVSIQLKKLKAETLALKMKQIM 436 

I I I I I : I I I I : I I I I I I : I : : : I I I I : ■ : : I : I : : I II 
Db 394 EAIYHGIPMVGVPLFADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPLYKEN 453 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 496 

I : I I I : I : I I I I : I : : I I I I : I :: I I I I I 

Db 454 AMKLS RI HHDQPVKPLERAVFWI EFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACV 513 

Qy 497 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : I I I I : : I : I I 
Db 514 ATVIFIITKCL-FCVWKFVRTGKKGK 538 



RESULT 6 
US-09-356-806-8 

; Sequence 8, Application US/09356806 

; Patent No. 6586175 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

; APPLICANT: Galvin, Margaret 

; APPLICANT: Miller, Andrew 

; APPLICANT: Reidy, Michael 

; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltrans f erase 2B4 (UGT2B4), 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 
; FILE REFERENCE: SEQ-22PRV2 

; CURRENT APPLICATION NUMBER: US/09/356,806 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ ID NOS : 164 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 8 

LENGTH: 528 

TYPE: PRT 
; ORGANISM: H. sapiens 
US-09-356-806-8 

Query Match 23.9%; Score. 661. 5; DB 4; Length 528; 

Best Local Similarity 34.2%; Pred. No. 1.7e-61; 

Matches 173; Conservative 87; Mismatches 215; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG PFMPDFKKEEKS YQVI SWLAPEDHQREF 89 

||:::: I I I I I : I II II : : I I : : 

Db 34 SHWMNIKTILDELVQRGHEVTVLASSASISFDPNSPSTLKFEVYPVSLTKTEFEDIIKQL 93 

Qy 90 KKSFDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145 

I : ::| I : :: : I I :: | :| |: I |:|: : 

Db 94 VKRWAELPKDTFW- - S YFSQVQEIMWTFNDI LRKFCKDI VSNKKLMKKLQESRFDWLAD 151 



Qy 146 TFDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPI PLS YVPVFRSLLTDHMDFWG 199 

II I : I I I III I I I : : I II I I I I I I I hi I I 

Db 152 A--VFPFGELLAELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 209 

Qy 200 RVKN- - FLMFFS FCRRQQHMQSTFDNTI KEHFTEGS RPVLSHLLLKAELWFINSDFAFDF 257 

I I I I : : : : I I : I : : I I I I I : I I : : I I : : I I 
Db 210 RVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYWDFQF 266 



Qy 258 ARPLLPNTVYVGGLMEKPI KPVPQDLENFIAKFGDSGFVXVTLGSMV-NTCQNPEI FKEM 316 

I I I I I : I I I I | | | | : I : : : I I : I : : I I : : I I I I I I I : I : 
Db 267 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE — ERANVI 324 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLMPSIRLFVTHGGQNSIM 376 

: I I : I I I : I : : I I I : : I : I I : I I I I I I I : I I I I I I 
Db 325 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 381 



Qy 377 EAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSA 436 

I I I I I : I I I I : I I I I I I : I : : : I I I I : : : I : I : : I II 
Db 382 EAIYHGIPMVGVPLFADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPLYKEN 441 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 496 

I: II 1:1 I I II: I:: MM: I : : II III 

Db 442 AMKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACV 501 

Qy 4 97 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : I I I I : : I : I I 
Db 502 ATVI FI ITKCL- FCVWKFVRTGKKGK 526 



RESULT 7 

PCT-US92-00282-6 

; Sequence 6, Application PC/TUS9200282 
; GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 

APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
; TITLE OF INVENTION: THEREIN. 

NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
; STREET: 1615 L STREET, N-W. 

; CITY: WASHINGTON 

STATE: D.C. 
COUNTRY: U.S.A. 
ZIP: 20036-5601 
.;. COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 00282 
FILING DATE: 19920110 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: SCOTT, WATSON T. 
; REGISTRATION NUMBER: 26581 



; REFERENCE/ DOCKET NUMBER: 91532-PCT 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-861-3000 
; TELEFAX: 202-822-0944 

; TELEX: 6714627 CUSH 

INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 531 amino acids 

TYPE: AMINO ACID 
; STRANDEDNESS : single 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
PCT-US92-00282-6 



Query Match 23.8%; Score 658; DB 5; Length 531; 

Best Local Similarity 30.9%; Pred. No. 4.2e-61; 

Matches 171; Conservative 106; Mismatches 211; Indels 66; Gaps 17; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: | |||: || : I I : I : : I I I : I I I : I II : I 
Db 7 LRGLSGLLLLLCALP WAEGGKVL-VFPMEGSHWLSMRDVVRELHARGHQAVVL 58 

Qy 61 GPFMPDFKKEEKSYQVI SWLAP EDHQREF KKSFD FFLEETLGGRGK 106 

| : | | : : : : | I : : I I I III: MM: I 

Db 59 APEVTVHMKGEDFFTLQTYAFPYTKEEYQREILGNAKKGFEPQHFVKTFF — ETMASIKK 116 

Qy 107 FENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAIL .166 

| : | I I : I : I : : I : : I I : I : : I MM I II I 

Db 117 FFDL YANSCAALLHNKTLIQQLNSSSFDWLTDPVFPCGALLAKYLQIPAVFFL 170 

Qy 167 STSFGSLEF GLPI PLS YVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQST 220 

: : : : I I I M I : : I : M I I I I I I I : I Ml 
Db 171 RSVPCGIDYEATQCPKPSSYIPNLLTMLSDHMTFLQRVKNMLYPLTLKYIC HLSIT 226 

Qy 221 FDNTIKEHFTEGSR PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIK 277 

: : : MM I : I I I I M I M : I I M : I M ' I 

Db 227 PYESLASELLQREMSLVEVLSH ASVWLFRGDFVFDYPRPIMPNMVFI GGINCVI KK 282 

Qy 278 PVPQDLENFIAKFGDSGFVXWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWP 337 

|: |: I : : M I M : I M I M : M I : I I : : M : ■ 
Db 283 PLSQEFEAYVNASGEHGI WFSLGSMVSEI PEKKAM-EIAEALGRI PQTLLWRYTGT 338 

Qy 338 KDVliLAANVTCI VT)WLPQSDLLAHPSIRLF 397 

: : I I I : I M I M II I II I I : I I I : I I I : I I I I I : I I I M I M 
Db 339 RPSNLAKNTILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMVMMPLFGDQMDN 398 

Qy 398 MV^V^^KFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVG 457 

|:|: | | : : : : : I : I : I : : : I I I : I : : Ml I 
Db 399 AKRMETRGAGWLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVF 458 

Qy 458 WIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGM 509 

|: : : I : : I I I M I M : : II III s I = : : II I 

Db 459 WV^YVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLIAIVXTWFIVTKSCAYGCRKCFG- 517 

Qy 510 AVWWLRGARKVKET 523 

I : I M : 

Db 518 GKGRVKKS 525 



RESULT 8 
US-09-180-852-2 

; Sequence 2, Application US/09180852 

; Patent No. 6287834 

; GENERAL INFORMATION: 

; APPLICANT: BELANGER, Alain 

; APPLICANT: HUM, Dean W. 

; APPLICANT: BEAULIEU, Martin 

; APPLICANT: LEVESQUE, Eric 

; TITLE OF INVENTION: CHARACTERI Z AT ION AND USE OF AN ISOLATED URIDINE 
; TITLE OF INVENTION: DI PHOSPHO-GLUCURONOSYLTRANSFERASE 

FILE REFERENCE: 1259-449 
; CURRENT APPLICATION NUMBER: US/09/180,852 
; CURRENT FILING DATE: 1999-02-08 
; EARLIER APPLICATION NUMBER: PCT/CA97/00328 
; EARLIER FILING DATE: 1997-05-16 
; EARLIER APPLICATION NUMBER: US 08/649, 319 
; EARLIER FILING DATE: 1996-05-17 
; NUMBER OF SEQ ID NOS : 2 

SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 2 

LENGTH: 530 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-180-852-2 

Query Match 23.7%; Score 657; DB 3; Length 530; 

Best Local Similarity 32.6%; Pred. No. 5.3e-61; 

Matches 159; Conservative 92; Mismatches 211; Indels 26; Gaps 11; 



Qy 


34 


SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVI-SWLAPEDHQREFKKS 

I I : : 1 : : 1 1 1 1 : 1 : 1 : 1 : 1 1 : 1 1 
SHWINMKT I LEELVQRGHEVI VLTS S AS I LVNAS KS SAI KLEVYPT S LTKNDLEDFFMKM 


92 


Db 


34 


93 


Qy 


93 


FDFFLEETLGGRGKFENLLNVLEYLALQCSHF LNRKDIMDSLKNENFDMVI 

||: : | : : | : | : | : I I : I : 1 1 : 1 1 : : : 
FDRWTYSI — SKNTFWSYFSQLQELCWEYSDYNIKLCEDAVLNKK-LMRKLQESKFDVLL 


143 


Db 


94 


150 


Qy 


144 


VETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFWG 

: : | |:|||'||: 1 1 1 : : 1 1 1 1 1 1 1 1 1 1 : 1 1 1 
ADAVNPCGELLAELLNIPFLYSLRFSVGYTVEKNGGGFLFPPSYVPWMSELSDQMIFME 


199 


Db 


151 


210 


Qy 


200 


RVKN — FLMFFS FCRRQQHMQSTFDNTI KEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 


257 


Db 


211 


| : | I : : : : | | : : : : 1 1 1 1 : | | | : | | :, t : I : I 
RIKNMIYMLYFDFWFQAYDLKK-WDQFYSE— VLGRPTTLFETMGKAEMWLIRTYWDFEF 


267 


Qy 


258 


ARPLLPNTWVGGLMEKPIKPVPQDLENFIAKFGDSGFVXVTLGSMVNTCQNPEIFKEMN 

I I 1 I 1 : I I II 1 1 1 1 : 1 ::: 1 1 : 1 : : 1 1 : : 1 1 1 1 :.: : 1 
PRPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGIWFSLGSMISN-MSEESANMIA 


317 


Db 


268 


326 


Qy 


318 


NAFAHLPQGVIWKCQCSHWPKDvliLAANvT<IVT)WLPQSDLLAHPSIRLBVr^ 

: | | : | | I : I : 1 1 : 1 : : 1 1 1 1 : 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 
SALAQI PQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYE 


377 


Db 


327 


383 


Qy 


378 


AIQHGVPMVGIPLFGDQPENMWVEIAKKFGVSIQLKKLKAETLAIjKMKQIMEDKRY 


437 



II I I : | I I I I I I I I I : h : : I I : I : : : : : I : I : : I II 
Db 384 AI YHGI PMVGI PLFADQHDNI AHMKAKGAALSVDI RTMS SRDLLNALKSVINDPI YKENI 443 

Qy 438 VAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTL 497 

: I I I : I I I I I : I : : I I I I : I :: I I I I I 

Db 444 MKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHNLTWIQYHSLDVIAFLLACVA. 503 

Qy 498 GTLWLCGK 505 

::: I 

Db 504 TMIFMITK 511 



RESULT 9 

US-09-356-806-40 

; Sequence 40, Application US/09356806 

; Patent No. 6586175 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

; APPLICANT: Galvin, Margaret 

; APPLICANT: Miller, Andrew 

; APPLICANT: Reidy, Michael 

; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltransf erase 2B4 (UGT2B4), 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 
; FILE REFERENCE: SEQ-22PRV2 

; CURRENT APPLICATION NUMBER: US/09/356,806 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ .ID NOS : 164 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 40 

LENGTH: 524 

TYPE: PRT 
; ORGANISM: H. sapiens 
US-09-356-806-40 

Query Match 23.5%; Score 651.5; DB 4; Length 524; 

Best Local Similarity 31.5%; Pred. No. 2e-60; 

Matches 163; Conservative 87; Mismatches 212; Indels 55; Gaps 14; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG-PFMPD FKKEEKSY Q 75 

||:::: I I I I I : I I I : I I : : : I 

Db 34 SHWMNIKTILDELIQRGHEVTVLASSASILFDPNNSSALKIEIYPTSLTKTELENFIMQQ 93 

Qy 76 VTSWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDS 133 

: | | : | - ,.: I : : : : : : : : : I : : I I 

Db 94 IKRWSDLPKD TFWLYFSQV QEIMSIFGDITRKFCKDWSNKKFMKK 139 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTS FGSLEFGLPIPLSYVPVFRS 189 

:: I I :: : I I : I I III III I I I I I I I I I 

Db 140 VQESRFDVIFADAIFPCSELLAELFNIPFVYSLSFSPGYTFEKHSGGFIFPPSYVPWMS 199 

Qy 190 LLTDHMDFWGRVKN— FLMFFSFCRRQQHMQSTFDNTIKEHFTE GSRPVLSHLLLKA 244 

I | | I I I I I I :::: I I II : I I I I : I I 

Db 200 ELTDQMTFMERVKNMIYVLYFDF WFEIFDMKKWDQFYSEVLGRPTTLSETMGKA 253 



Qy 



245 ELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV 304 



: : I I : : I I I I I I I : I I I I II I I : I : : : I : I : I :: I I : : I I I I I 
Db 254 DVWLIRNSWNFQFPYPLLPNVDFVGGLHCKPAKPLPKEMEDFVQSSGENGVWFSLGSMV, 313 

Qy 305 NTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANWIVDWLP 364 

: I : : I I : I I I : I : : I I I : : I : I I : I I I I I I 
Db 314 SN-MTEERANVIASALAQI PQKVLWRFDGN KPDTLGLNTRLYKWI PQNDLLGHPKTR 369 

Qy 365 LFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVTWEAKKFGVS IQLKKLKAETLALKM 424 

I : I I I I I I I I I I I : I I I I I I I I I I I : I : : : I : I : : : I : 
Db 370 AFITHGGANGIYEAIYHGIPMVGIPLFADQPDNIAHMKARGAAVRVDFNTMSSTDLLNAL 429 

Qy 425 KQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVTQTGGATHLKPYVFQQPWHEQY 484 

I : : : I I I : I I I : I I I I I : I : : I I I I : I = : 

Db 430 KRVI N D P S Y KENVMKL SRI QH DQ P VK P L D RAVFW I E FVMRH KGAKH L RVAAH D LT W FQ YH 489 

Qy 4 85 LFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVK 521 

II III : : : I I : I I I I 

Db 490 SLDVIGFLLVCVATVIFIVTKCCLFCFW— KFARKAK 524 



RESULT 10 
. US-09-949-016-8605 

; Sequence 8605, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949, 016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 8605 

; LENGTH: 439 

; TYPE: PRT 

ORGANISM: Human 

FEATURE: 

NAME/ KEY: VARIANT 
; LOCATION: (1) . . . (439) 

OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-949-016-8605 

Query Match 23.4%; Score 649; DB 4; Length 439; 

Best Local Similarity 34.4%; Pred. No. 2.8e-60; 

Matches 147; Conservative 74; Mismatches 184; Indels 22; Gaps 8; 

Qy 106 KFENLLNVLEYLALQ-CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVA 164 

I : : : : : : : I : : I I : : II::: I I : I I III 
Db 21 KYREIMS I FGDITRKFCKDWSNKKFMKKVQESRFDVI FADAI FPCSELLAELFNI PFVY 80 



165 ILSTS — --FGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNt-FLMFFSFCRRQQHMQ 218 

III I I I Mill I III I I I I I I I 
81 SLSFSPGYTFEKHSGGFIFPPSYVPVVMSELTDQMTFMERVKNMIYVLYFDF WF 134 

219 STFDNTIKEHFTE GSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKP 275 

II : I I II : | | : : I | : : I I I I I I I : I I I I II 

135 EIFDMKKWDQFYSEVLGRPTTLSETMGKADVWLIRNSWNFQFPYPLLPNVDFVGGLHCKP 194 

276 IKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSH 335 

I I : I : : : I : I : I : : I I : : I I I I I : I : : I I : I I I : I : : 

195 AKPLPKEMEDFVQSSGENGVWFSLGS1WSN-MTEERAWIASALAQIPQKVLWR 252 

336 ' WPKDVHLAANVKI VDWLPQSDLLAHPS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQP 395 

I I I : : I : I I : I I I II I I : I I I I I I III I I : I I I I I I I I III 
253 — KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIYEAIYHGIPMVGIPLFADQP 310 

396 ENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRL 455 

: I : : : I : I : : : I : I : : : I II : II I : I I 

311 DNIAHMKARGAAVRVDFNTMSSTDLLNALKRVINDPSYKE^rVMKLSRIQHDQPVKPLDRA 370 

456 VGWIDHVTiQTGGATHLKPWFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLR 515 

I I I : I : : I I I I : I : : I I I I I : : : I I : 

371 VFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVIGFLLVCVATVIFIVTKCCLFCFW — K 428 

516 GARKVKE 522 

III I : 
429 FARKAKK 435 



RESULT 11 
US-09-356-806-113 

; Sequence 113, Application US/09356806 
; Patent No. 6586175 
; GENERAL INFORMATION: 
; APPLICANT: Penny, Laura 
; APPLICANT: Galvin, Margaret 
; APPLICANT: Miller, Andrew 
APPLICANT: Reidy, Michael 
; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltrans f erase 2B4 (UGT2B4 ) , 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 
; FILE REFERENCE: SEQ-22PRV2 

; CURRENT APPLICATION NUMBER: US/09/356,806 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ ID NOS : 164 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 113 

LENGTH: 530 

TYPE: PRT 
; ORGANISM: H. sapiens 
US-09-356-806-113 

Query Match 23.4%; Score 649; DB 4; Length 530; 

Best Local Similarity 31.8%; Pred. No. 3.8e-60; 

Matches 155; Conservative 90; Mismatches 218; Indels 24; Gaps 10; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRGPFMPDFKKEEKS YQVI - SWLAPEDHQREFKKS 92 

I I : : I : : I I I I I : I : I : I : I I : I 

Db 34 SHWINMKTILEELVQRGHEVTVLTSSASTLVNASKSSAIKLEVYPTSLTKNDLEDSLLKI 93 

Qy 93 FDFFLEETLGGRGKFENLLNVLEYLALQ CSHFLNRKDIMDSLKNENFDMVIV 144 

I : : : | : : | : | : I : | : | | : I I : : : 

Db 94 LDRWIYGV— SKNTFWSYFSQLQELCWEYYDYSNKLCKDAVLNKKLMMKLQESKFDVILA 151 

Qy 145 ETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFWGR 200 

: : I I : I I I I : I I I : I I I II I I I I hi I I I 

Db 152 DALNPCGELIAELFNIPFLYSLRFSVGYTFEKNGGGFLFPPSYVPVVMSELSDQMIFMER 211 

Qy 201 VKNF — LMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

: I I : : : I I : : : : I I I I : I I I : I I : : I : I 

Db 212 I KNMIHMLYFDFWFQI YDLKK-WDQFYS E — VLGRPTTLFETMGKAEMWLI RT YWDFEFP 268 

Qy 259 RPLLPNTVWGGIJyiEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 318 

I I I I I : I I I I I I I I : I : : : I I : I : : I I : : I I I I : : : I : : 
Db 2 69 RPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGIWFSLGSMISN-MSEESANMIAS 327 

Qy 319 AFAHLPQGVIWKCQCSHWPKDVHLAANWIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEA 378 

I I : I I I : I : I I : I : : I I I I : I I I I I : I : I I I I I I I I 

Db 328 ALAQI PQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEA 384 

Qy 379 IQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAV 438 

I I I : I I I I I I I I I I : I : : : I I : I : : : : : I : I : : I II : 
Db 385 IYHGIPMVGIPLFADQHDNIAHMKAKGAALSVDIRTMSSRDLLNALKSVINDPVYKENVM 444 

Qy 439 AASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLG 498 

II I : I I I I I : I : : I I II : I :: I I I I I 

Db 445 KLSRIHHDQPMKPLDRAVFWIEFVMRHKGAKHLRVAAHNLTWIQYHSLDVIAFLLACVAT 504 

Qy 4 99 TLWLCGK 505 

... I 
... i 

Db 505 VIFIITK 511 



RESULT 12 

US-09-949-016-6999 

Sequence 6999, Application US/09949016 
Patent No. 6812339 
GENERAL INFORMATION: 
APPLICANT: VENTER, J. Craig et al . 

TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 
TITLE. OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 
THEREOF 

FILE REFERENCE: CL001307 

CURRENT APPLICATION NUMBER: US/09/949,016 
CURRENT FILING DATE: 2000-04-14 
PRIOR APPLICATION NUMBER: 60/241,755 
PRIOR FILING DATE: 2000-10-20 
PRIOR APPLICATION NUMBER: 60/237,768 
PRIOR FILING DATE: 2000-10-03 
PRIOR APPLICATION NUMBER: 60/231,498 
PRIOR FILING DATE: 2000-09-08 
NUMBER OF SEQ ID NOS : 207 012 



SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 6999 
LENGTH: 528 
TYPE: PRT 
ORGANISM: Human 
US-09-949-016-6999. 

Query Match 22.9%; Score 632.5; DB 4; Length 528; 

Best Local Similarity 33.4%; Pred. No. 2.2e-58; 

Matches 169; Conservative 88; Mismatches 218; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVI SWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 34 SHWMNI KTI LDELVQRGHEVTVLAS SAS I S FDPNS PSTLKFEVYPVSLTKTEFEDI I KQL 93 

Qy 90 KKS FDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145 

I : : : I | : : : : I I : : I : I I : I I : I : : 

Db 94 VKRWAELPKDTFW — SYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQESRFDWLAD 151 

Qy 146 TFDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFWG 199 

111:111111 I I : : I II I Mill I hi I I 

Db 152 A— VFPFGELLAELLKIPFVYRPRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 209 

Qy 200 RVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 257 

I I I I : : : : I I : I : : I I I II : I I : : I I : : I I 
Db 210 RVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYWDFQF 266 

Qy 258 ARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEIFKEM 316 

I I I II : I I I I | | | | : I : : : I I : I : : I I : : I I I I I I I : I : 
Db 267 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE — ERANVI 324 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVliLAANvT<IVT)WLPQSDLLAHPSIRLFW 376 

: I I : I I I : I : : I I I : : I : I I : I I I I I I I : I I I I I I 
Db 325 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 381 

Qy 377 EAIQHGVPMVGIPLFGDQPENMVT^VTIAKKFGVSIQLKKLKAETLALKMK^ 436 

: I I : I I I I : I I I I I I : I : :: I I I I : : : I : | : : I II 
Db 382 KAISPRI PMVGVPLFADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPLYKEN 441 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVXQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 4 96 

I : II I : I I I I I : I : : I I I I : I : ' : I I I I I 

Db 442 AMKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACV 501 

Qy 497 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : I I I I : : I : I I 
Db 502. ATVIFIITKCL-FCVWKFVRTGKKGK 526 



RESULT 13 
PCT-US92-00282-5 

; Sequence 5, Application PC/TUS9200282 
; GENERAL INFORMATION: 

APPLICANT: OWENS , IDA S. 

APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES: 40 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 

STREET: 1615 L STREET, N.W. 

CITY: WASHINGTON 

STATE: D.C. 

COUNTRY: U.S.A. 

ZIP: 20036-5601 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/00282 

FILING DATE: 19920110 

CLASSIFICATION: 435 
ATTORNEY/ AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

; REGISTRATION NUMBER: 26581 

REFERENCE/ DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 

TELEFAX: 202-822-0944 

TELEX: 6714627 CUSH 
; INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 531 amino acids 

TYPE: AMINO ACID 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-00282-5 

Query Match 22.7%; Score 629; DB 5; Length 531; 

Best Local Similarity 30.4%; Pred. No. 5.3e-58; 

Matches 159; Conservative 100; Mismatches 218; Indels 46; Gaps 13; 

Qy 4 QRVLLLVGFL-LPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGP 62 

II: | | | | | : : : : | : | : I | | : | | : : : I I I I : : : 
Db 9 QRISAGVFFLALWGMWGD--KLLWPQ-DGSHWLSMKDIVEVLSDRGHEIVVV 59 

Qy 63 FMPDFK KEEKS YQVI SWLAPEDHQREFKKS FDFFLEETLGGRG KFENLLN 112 

: I : I I I I : I I I I I : I I : : I : 

Db 60 -VPEVNLLLKEYKYYTRKIYPVPYD-QEELKNRYQSFGNNHFAERSFLTAPQTEYRNNMI 117 

Qy 113 VLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGS 122.-. 

I : : I | : | : : I I I : : I : : I I I I I I : I 

Db 118 VIGLYFINCQSLLQDRDTLNFFKESKFDALFTDPALPCGVILAEYLGLPSVYLFRGFPCS 177 

Qy 173 LEFGL PI PLS YVPVFRSLLTDHMDFWGRVKNFL MFFSFCRRQQHMQSTFD 222 

II I I : I I : I :: I I I I I I I I I : I : : : : I 

Db 178 LEHTFSRSPDPVSYIPRCYTKFSDHMTFSQRVANFLVNLLEPYLFYCLFSKYEKLASA — 235 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFT^RPLLPNTVYVGGLMEKPIKPVPQD 282 

: I II I : : I : II : : I I : : I I I : : I I : I I : I : 

Db 236 -VLK RDVDIITLSEVSVWLLRYDFVXEYPRPVMPNMVFIGGINCKKRKDLSQE 287 



Qy 283 LENFIAKFGDSGFVLVTLGS1WNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 

.|:| I : I I : : I I I I I : : : -: I I I I : I : : : : I 
Db 288 FEAYINASGEHGIWFSLGSMVSEIPEKKAM-AIADALGKNPQTVLWRYTGT RPSNL 343 

Qy 343 AANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEA 402 

II : I I I I I : I I I I I I I : I I I : : I : I : II I I I : I I I I II : I hi 
Db 344 ANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRME 403 

Qy 403 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 462 

I | | :::::: | | : I : : I I I I : I : : I : I I I : : I 

Db 404 TKGAGWLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFV 463 

Qy 463 LQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

: : I I I I : I I : : : I I I I I : I : : I 

Db 464 MRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 506 



RESULT 14 
PCT-US92-00282-4 

; Sequence 4, Application PC/TUS9200282 

; GENERAL INFORMATION: 

; APPLICANT: OWENS, IDA S. 

; - APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
; STREET: 1615 L STREET, N.W. 

CITY: WASHINGTON 
STATE : D . C . 
COUNTRY: U.S.A. 
ZIP: 20036-5601 
; COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/00282 
FILING DATE: 19920110 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: SCOTT, WATSON T. 
REGISTRATION NUMBER: 26581 
REFERENCE/ DOCKET NUMBER: 91532-PCT 
; TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-861-3000 
TELEFAX: 202-822-0944 
TELEX: 6714627 CUSH 
; INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 534 amino acids 

TYPE: AMINO ACID 
; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 



PCT-US92-00282-4 



Query Match 21.9%; Score 606; DB 5; Length 534; 

Best Local Similarity 31.3%; Pred. No. 1.5e-55; 

Matches 163; Conservative 94; Mismatches 218; Indels 46; Gaps 13; 

Qy 8 LLVGFLLPGVLLS EAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGP 62 

I I I I III | : | : | : | I I : I I : I II : I I 
Db 12 LATGLLL LLSVQPWAESGKVXWPT-DGSPWLSMREALRELHARGHQAVVLT P 63 

Qy 63 FMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLG GRGKFENLL 111 

: | | | | : : : : | | : | | : Ml I : I : I I 

Db 64 EVNMHIKEEKFFTLTAYAVPWT-QKEFDR VTLGYTQGFFETEHLLKRYSRSMA 115 

Qy 112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILST 168 

II I I | : : : : I : I I : I : : : I : : I : I I I 
Db 116 IMNNVSLALHRCCVELLHNEALIRHLNATSFDWLTDPWLCGAVLAKYLSIPAVFFWRY 175 

Qy 169 SFGSLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTI 225 

I : I lllhl :: I I I I I I I I I : : : : 

Db 176 IPCDLDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLA 235 

Qy 226 KEHFTEGSRPV-LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLE 284 

II II: I : I : I II I : I I : : I I I : : I I : I I : I : I 

Db 236 SELF QREVSWDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFE 292 

Qy 285 NFIAKFGDSGFVXWLGSMVNTCQNPEIFKEMNN 344 

: I I : I I : : I I I I I : : : : I : I I I : I : : : : I I 
Db 293 AYINASGEHGIWFSLGSMVSEIPEKKAM-AIADALGKIPQTVLWRYTGT RPSNLAN 348 

Qy 345 NVT<IVI>WLPQSDLLAHPSIRLFvTHGGQNSIMEAIQHGVPMVGIPLFGDQPENF^^ 404 

I : I I I I I : I I I I I I I : I I I : : I : I : I I I I I : I I I I I I : I hi I 
Db 349 NTILV1WLPQNDLLGHPMTRAFITHAGSHGWESICNGVPMVMMPLFGDQMDNAKRMETK 408 

Qy 4 05 KFGVS IQLKKLKAETLALKMKQ1MEDKRYKSAAVAAS VI LRSHPLS PTQRLVGWI DHVLQ 4 64 

I I :::::: I I : I : : I I I I : I : : I : I I I : : I : : 

Db 4 09 GAGVTLNVXEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRP 4 68 

Qy 465 TGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

II I I : I I : : : II I I I : I : : I 

Db 4 69 HKGAPHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 509 



RESULT 15 
PCT-US92-00282-7 

Sequence 7, Application PC/TUS9200282 

GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 
APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES: 4 0 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
STREET: 1615 L STREET, N.W. 
CITY: WASHINGTON 
STATE: D.C. 



COUNTRY: U.S.A. 

ZIP: 20036-5601 
COMPUTER READABLE FORM: 

MEDIUM TYPE:" Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/00282 

FILING DATE: 19920110 

CLASSIFICATION: 435 
; ATTORNEY/AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

; REGISTRATION NUMBER: 26581. 

REFERENCE/DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 

TELEFAX: 202-822-0944 

TELEX: 6714 627 CUSH 
; INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 529 amino acids 
; TYPE: AMINO ACID 

; STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
PCT-US92-00282-7 

Query Match 21.7%; Score 600; DB 5; Length 529; 

Best .Local Similarity 28.3%; Pred. No. 6.6e-55; 

Matches 155; Conservative 105; Mismatches 222; Indels 66; Gaps 14; 

Qy 8 LLVGFL LPGVLLSEAAKI LTI STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKRGPFM 64 

I I I I I I : I : I : I : I I I : I I : : I : I I : : : I : 
Db 10 LPAGFLFLVLWGSVLGD — KLLWPQ-DGSHWLSMKEIVEHLSERGHDIWL V 59 

Qy 65 PDFK KEEKS YQVISWLAP EDHQREFKKSFDFFLEETLGGRGKFENLLNVLE 115 

I : I I I : I : I I I : : : : I : I : : 

Db 60 PEVNLLLGES KYYRRKS FPVP YNLEELRTRYRS FGNNH FAAS S PLMAPLRE YRNNMI VI D 119 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEF 175 

II : | : | | : : | : : | I | I : : III 

Db 120 MCFFSCQSLLKDSATLSFLRENQFDALFTDPAMPCGVILAEYLKLPSIYLFRGFPCSLEH 179 

Qy 176 — GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGS 233 

11:11111: : I I I I I : I I : - ■ .: : I : I 

Db 180 IGQSPSPVSYVPRFYTKFSDHMTFPQRLANFI ANILENYL-YHCLYSK 226 

Qy 234 RPVLSHLLLKAE LWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDL 283 

: I : Ml: I I : I I I : : I I : : I I : : : I I I = I : 

Db 227 YEILASDLLKRDVSLPALHQNSLWLLRYDFVFEYPRPVMPNMIFIGGTNCKKKGNLSQEF 286 

Qy 284 ENFIAKFGDSGFVXWLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLA 343 

I : : i : I I : : I I II I : : I : I : I I : : I : : : : I I 

Db 287 EAYVNASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTLLWRYTGT — -RPSNLA 342 



Qy 



344 ANVKIVT)WLPQSDLIxAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEA 403 



I : I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I I I I : I I : I 
Db 343 KNTILVKWLPQNDLLGHPKARAFITHSGSHGIY 402 

Qy 404 KKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVL 463 

: | | : : : : : I : I : I : : : I I I : I : : hi I I : : : I : 
Db 403 RGAGVTLNVLEM'ADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVFWVEYVM 462 

Qy 464 QTGGATHLKPWFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGMAVWWLR 515 

: I I I I : I I : : : I I I I I : I : : : III 
Db 463 RHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAIVXTWFIVYKSCAYGCRKCFG 515 

Qy 516 GARKVKET 523 

I : I I : : 
Db 516 GKGRVKKS 523 



Search completed: February 15, 2005, 12:54:31 
Job time : 57 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 2768 



Sequence : 



February 15, 2005, 12:26:43' ; Search time 40 Seconds 

(without alignments) 
1258.034 Million cell updates/sec 

US-10-017-867A-282 



1 MAGQRVLLLVGFLLPGVLLS GKLLGMAVWWLRGARKVKET 523 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 283416 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1500 summaries 

Database : PIR_79:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


700 


25' 


.3 


541 


2 


JC5423 


2-hydroxyacylsphin 


2 


684 


24 


.7 


541 


2 


A48801 


. . 2-hydroxyacylsphin 


3 


679 


24 


.5 


533 


2 


A39092 


glucuronosyltransf 


4 


674.5 


24 


.4 


530 


2 


A40467 


glucuronosyltransf 


5 


671 


24 


.2 


529 


2 


A42233 


glucuronosyltransf 


6 


67 0 


24 


.2 


531 


2 


B47113 


glucuronosyltransf 


7 


662.5 


23 


.9 


530 


2 


C47113 


glucuronosyltransf 


8 


658 


23 


.8 


531 


2 


A35343 


glucuronosyltransf 


9 


658 


23 


.8 


535 


2 


157961 


glucuronosyltransf 


10 


656.5 


23 


.7 


528 


2 


JN0619 


glucuronosyltransf 


11 


654.5 


23 


.6 


529 


2 


A35366 


glucuronosyltransf 


12 


650 


23 


.5 


527 


2 


S15089 


glucuronosyltransf 


13 


649 


23 


.4 


530 


2 


A48633 


glucuronosyltransf 



J 



14 


642 


23. 


2 


528 


2 


JN0620 


UDP-glucuronosyltr 


15 


640 


23. 


.1 


530 


2 


S07390 


glucuronosyltrans f 


16 


639 


23. 


1 


530 


2 


A36276 


glucuronosyl trans f 


17 


637.5 


23. 


0 


530 


2 


S00163 


glucuronosyltrans f 


18 


632.5 


22. 


9 


530 


2 


S68200 


glucuronosyltrans f 


19 


632 


22. 


8 


523 


2 


S11309 


glucuronosyltrans f 


20 


629 


22. 


7 


531 


2 


A31340 


glucuronosyltrans f 


21 


623 


22. 


5 


530 


2 


JC5656 


UDP glucuronosyltr 


22 


622.5 


22. 


5 


529 


2 


JE0200 


orphan UDP-glucuro 


23 


612.5 


22. 


1 


531 


2 


A55788 


glucuronosyltrans f 


24 


610 


22. 


0 


530 


2 


S17512 


glucuronosyltrans f 


25 


600 


21. 


7 


529 


2 


A24600 


glucuronosyltrans f 


26 


596.5 


21. 


5 


531 


2 


B55788 


glucuronosyltrans f 


27 


513 


18. 


5 


485 


2 


T13694 


glucuronosyltrans f 


28 


471 


17. 


0 


949 


2 


.T18591 


hypothetical 


prote 


29 


455.5 


16. 


5 


534 


2 


T34455 


hypothetical 


prote 


30 


449 


16. 


2 


531 


2 


T33980 


hypothetical 


prote 


31 


448 


16. 


2 


573 


2 


T27578 


hypothetical 


prote 


32 


447 


16. 


1 


531 


2 


T23835 


hypothetical 


prote 


33 


441.5 


16. 


0 


508 


2 


T03910 


UDP-glucuronosyltr 


34 


438 


15. 


8 


534 


2 


T19944 


hvr>o1~h f*t~ i 1 


prote 


35 


427 


15. 


4 


586 


2 


T19075 




prote 


36 


424 


15. 


3 


534 


2 


T19951 


h\7T50l~hf i i~ i ral 


prote 


37 


419 


15. 


1 


533 


2 


T34458 


\m o 1~ h ^ *t~ i r*^ 1 

11 y £s\s \— 1 1 v — L V — CI _1_ 


prote 


38 


412 . 5 


14 . 


9 


475 


2 


T33979 


h vr^ot* h e*t" "i ral 

ii y k-* w cii^ o _i_ v— * ci _i_ 


prote 


39 


412.5 


14 . 


9 


537 


2 


T25536 


11 V W Ull^ -L. V— -C* -L 


prote 


40 


.411 


14. 


8 


745 


2 


T23893 


hvrtothf^t" i ral 

11 V w L>11^ -1_ V_^C1 JL 


prote 


41 


409.5 


14. 


8 


661 


2 


T32518 


hvDothetical 


prote 


42 


408 


14. 


7 


533 


2 


T34457 


hypothetical 


prote 


43 


405.5 


14 . 


6 


520 


2 


T19661 


hypothetical 


prote 


44 


405 


14. 


6 


530 


2 


T18596 


hypothetical 


prote 


45 


398. 5 


14. 


4 


526 


2 


T25535 


hypothetical 


prote 


46 


398 


14. 


4 


534 


2 


T27588 


hypothetical 


prote 


47 


392.5 


14 . 


2 


529 


2 


T33738 


hypothetical 


prote 


48 


391.5 


14 . 


1 


530 


2 


T19365 


hypothetical 


prote 


49 


391 


14. 


1 


537 


2 


T21823 


hypothetical 


prote 


50 


389 


14 . 


1 


531 


2 


T15329 


hypothetical 


prote 


51 


387 . 5 


14. 


0 


504 


2 


T34106 


hypothetical 


prote 


52 


386.5 


14. 


0 


542 


2 


T20457 


hypothetical 


prote 


53 


384 


13. 


9 


417 


2 


T34459 


hypothetical 


prote 


54 


383. 5 


13. 


9 


567 


2 


T25771 


hypothetical 


prote 


55 


382.5 


13. 


8 


574 


2 


T19738 


hypothetical 


prote 


56 


380.5 


13. 


7 


535 


2 


T24309 


hypothetical 


prote 


57 


379.5 


13. 


7 


535 


2 


T27582 


hypothetical 


prote 


58 


377 


13. 


6 


540 


2 


T33982 


hypothetical 


prote 


59 


375. 5 


13. 


6 


525 


2 


T24647 


hypothetical 


prote 


60 


375 


13. 


5 


522 


2 


T31893 


hypothetical 


prote 


61 


374 


13. 


5 


526 


2 


T27581 


hypothetical 


prote 


62 


373 


13. 


5 


611 


2 


T03890 


hypothetical 


prote 


63 


364.5 


13. 


2 


1003 


2 


T19638 


hypothetical 


prote 


64 


362 


13. 


, 1 


529 


2 


T18595 


hypothetical 


prote 


65 


361.5 


13. 


.1 


533 


2 


T27589 


hypothetical 


prote 


66 


361.5 


13. 


1 


581 


2 


T33737 


hypothetical 


prote 


67 


361 


13. 


,0 


531 


2 


T25537 


hypothetical 


prote 


68 


360.5 


13. 


0 


531 


2 


T33981 


hypothetical 


prote 


69 


357 


12. 


.9 


174 


2 


A47113 


glucuronosyltrans f 


70 


354.5 


12. 


.8 


540 


2 


T20456 


hypothetical 


prote 



71 


349.5 


12. 


6 


502 


2 


T25263 


hypothetical prote 


72 


346.5 


12. 


5 


515 


. 2 


S52453 


, ecdysteroid UDP-gl 


73 


344 


12. 


4 


710 


2 


B89252 


protein ZC455.4 [i 


74 


339 


12. 


2 


558 


2 


T15939 


hypothetical prote 


75 


333 


12. 


0 


506 


2 


H88809 


protein R11A8.3 [i 


76 


328 


11. 


8 


560 


2 


T30475 


probable phenol be 


77 


326 


11. 


8 


515 


2 


T32217 


hypothetical prote 


78 


322 


11. 


6 


540 


2 


T15573 


hypothetical prote 


79 


317 


11. 


5 


514 


2 


T25534 


hypothetical prote 


80 


317 


11. 


5 


593 


2 


T32485 


hypothetical prote 


81 


309.5 


11. 


2 


579 


2 


H88632 


protein F56B3.7 [i 


82 


309 


11. 


2 


523 


2 


T28727 


hypothetical prote 


83 


305. 5 


11. 


0 


440 


2 


T24478 


hypothetical prote 


84 


300.5 


10. 


9 


542 


2 


T28719 


hypothetical prote 


85 


292 


10. 


5 


435 


2 


T24477 


hypothetical prote 


86 


288 . 5 


10. 


4 


572 


2 


T32303 


hypothetical prote 


87 


280 


10. 


1 


523 


2 


T28721 


hypothetical prote 


88 


274 . 5 


9. 


9 


580 


2 


T28725 


hypothetical prote 


89 


272 


9 . 


8 


534 


2 


T28720 


hypothetical prote 


90 


270 


9 . 


8 


405 


2 


G89075 


protein K04A8.10 [ 


91 


264 . 5 


9 . 


6 


501 


2 


T31754 


hypothetical prote 


92 


259 . 5 


9 . 


4 


489 


2 


T10283 


ecdysteroid UDPglu 


93 


257 . 5 


9 . 


3 


495 


2 


T28722 


hypothetical prote 


94 


257 . 5 


9. 


3 


703 


2 


S40710 


hypothetical prote 


95 


256 


9. 


2 


310 


2 


G02309 


UDP-glucuronosyltr 


96 


256 


9 . 


2 


506 


2 


T33770 


hypothetical prote 


97 


255 


9 . 


2 


289 


2 


B42586 


glucuronosyl trans f 


98 


248 


9. 


0 


506 


2 


T41760 


UDP-glucosyl trans 


99 


246 . 5 


8 . 


9 


506 


1 


XUNVAC 


ecdysteroid UDPglu 


100 


245 . 5 


8 . 


9 


457 


2 


H89251 


protein ZC455.6 [i 


101 


244 . 5 


8 . 


8 


289 


2 


D42586 


glucuronosyl trans f 


1 0? 

J. \J <L 


244 


8 . 


8 


581 


2 


B87768 


protein F54C1.1 [i 


103 


242 . 5 


8 . 


8 


795 


^ 2 


T20609 


hypothetical prote 


104 


226 


8 . 


2 


289 


2 


C42586 


glucuronosyl trans f 


105 


224 . 5 


8 . 


1 


285 


2 


G02240 


UDP-glucuronosyltr 


106 


197 . 5 


7 . 


1 


488 


2 


T49903 


glucosyl trans f eras 


107 


186 


6. 


7 


419 


1 


S74500 


zeaxanthin glucosy 


108 


185.5 


6. 


7 


484 


2 


D71419 


probable indole— 3— 


109 


185 


6. 


7 


433 


2 


S51767 


glycosyl transfera 


110 


184 .5 


6. 


7 


453 


2 


A86191 


hypothetical prote 


111 


182 


6. 


6 


413 


2 


S52980 


zeaxanthin glucosy 


112 


181. 5 


6. 


6 


449 


2 


H84870 


probable glucosylt 


113 


181. 5 


6. 


6 


479 


2 


E71419 


probable indole-3- 


114 


180 . 5 


6. 


5 


260 


2 


T31894 


hypothetical prote 


115 


178 


6. 


4 


428 


2 


C70670 


probable glycosyl- 


116 


177 


6. 


.4 


457 


2 


B84725 


probable glucosylt 


117 


176. 5 


6. 


4 


1184 


2 


H86190 


hypothetical prote 


118 


175.5 


6. 


3 


480 


2 


B85014 


probable flavonol 


119 


173 . 5 


6. 


3 


431 


2 


B37802 


crtX protein - Erw 


120 


172 


6. 


2 


476 


2 


H86207 


hypothetical prote 


121 


171 


6. 


2 


452 


2 


T00981 


probable glucosylt 


122 


171 


6. 


2 


4 67 


2 


H84699 


probable flavonol 


123 


169.5 


6. 


, 1 


459 


2 


T51558 


probable flavonol 


124 


169.5 


6. 


, 1 


487 


2 


C86356 


UDP-glucose glucos 


125 


169 


6. 


.1 


456 


2 


T00506 


indole-3-acetate b 


126 


168.5 


6. 


,1 


456 


2 


F84724 


probable glucosylt 


127 


167.5 


6. 


.1 


475 


2 


F71419 


probable indole-3- 



128 


166 


6.0 


460 


2 


T00639 


hypothetical prote 


129 


165.5 . 


6.0 


407 


2 


T37104 


probable glycosyl 


130 


165.5 


6.0 


462 


2 


T01732 


UTP-glucose glucos 


131 


164.5 


5.9 


449 


2 


B84871 


probable glucosylt 


132 


164.5 


5.9 


472 


2 


T51559 


probable flavonol 


133 


164 


5.9 


471 


2 


A54739 


indole-3-acetate b 


134 


164 


5.9 


478 


2 


T08395 


UTP-glucose glucos 


135 


163.5 


5.9 


457 


2 


C85434 


glucosyltransf eras 


136 


163.5 


5.9 


471 


2 


S01052 


flavonol 3-O-gluco 


137 


162.5 


5.9 


420 


2 


T08005 


flavonol 3-O-gluco 


138 


162.5 


5.9 


453 


2 


T45603 


glucosyltransf eras 


139 


162 


5.9 


464 


2 


T47710 


glucuronosyl trans 


140 


161 


5.8 


474 


2 


A84700 


probable flavonol 


141 


161 


5.8 


1198 


2 


S51434 


hypothetical prote 


142 


159.5 


5.8 


478 


2 


A71417 


hypothetical prote 


143 


159 


5.7 


287 


2 


S41953 


UTP-glucose glucos 


144 


159 


5.7 


460 


2 


T51560 


probable flavonol 


145 


158.5 


5.7 


471 


1 


S08325 


flavonol 3-O-gluco 


146 


158 


5.7 


496 


2 


E84784 


probable glucosyl 


147 


157.5 


5.7 


435 


2 


H86924 


probable glycosyl 


148 


157.5 


5.7 


467 


2 


F84699 


probable flavonol 


149 


157.5 


5.7 


471 


2 


S01037 


flavonol 3-O-gluco 


150 


157 


5.7 


317 


2 


T31897 


hypothetical prote 


151 


156.5 


5.7 


442 


2 


F84618 


probable flavonol 


152 


155.5 


5.6 


392 


1 


C69851 


macrolide glycosyl 


153 


155 


5.6 


470 


2 


D84614 


hypothetical prote 


154 


154 . 5 


5.6 


431 


2 


S52583 


crtX protein - Erw 


155 


154 


5.6 


495 


2 


H84784 


probable glucosyl 


156 


153. 5 


5.5 


487 


2 


S41951 


UTP-glucose glucos 


157 


153 


5.5 


451 


2 


T45604 


glucosyltransf eras 


158 


153 


5.5 


496 


2 


F84784 


probable glucosyl 


159 


152.5 


5.5 


495 


2 


G84784 


probable glucosyl 


160 


151.5 


5.5 


438 


2 


E86924 


probable glycosyl 


161 


151. 5 


5.5 


478 


2 


E84545 


probable glucosylt 


162 


151 


5.5 


453 


2 


D86430 


probable UDP-gulco 


163 


149.5 


5.4 


449 


2 


G70670 


hypothetical prote 


164 


149.5 


5.4 


466 


2 


T07404 


probable glucosylt 


165 


149.5 


5.4 


489 


2 


H86356 


probable UDP-gluco 


166 


149 


5.4 


507 


2 


T46161 


glucosyl trans f eras 


167 


148.5 


5.4 


347 


2 


T06371 


probable UDP-glucu 


168 


148.5 


5.4 


438 


2 


T45602 


glucosyltransf eras 


169 


147.5 


5.3 


442 


2 


H85096 


hypothetical prote 


170 


147.5 


5.3 


479 


2 


F86356 


T16E15.2 protein - 


171 


147.5 


5.3 


481 


2 


B84700 


probable flavonol 


172 


146.5 


5.3 


452 


2 


T12981 


hypothetical prote 


173 


146.5 


5.3 


458.. 


.2 


C71420 


hypothetical prote 


174 


146.5 


5 . 3 


476 


2 


T03745 


glucosyltransf eras 


175 


146.5 


5.3 


496 


2 


D84784 


probable glucosyl 


176 


146 


5.3 


452 


2 


G71416 


probable glucosylt 


177 


146 


5.3 


491 


2 


C84784 


probable glucosyl 


178 


145.5 


5.3 


481 


2 


G86144 


hypothetical prote 


179 


145 


5.2 


453 


2 


T00511 


indole-3-acetate b 


180 


144 


5.2 


407 


2 


C97234 


probable glycosyl 


181 


143.5 


5.2 


479 


2 


G86207 


hypothetical prote 


182 


143 


5.2 


469 


2 


D86144 


protein probable U 


183 


143 


5.2 


481 


2 


E86356 


hypothetical prote 


184 


142.5 


5.1 


472 


2 


S39507 


glucuronosyl trans 



185 


142 


5.1 


424 


2 


T51110 


glycosyltransf eras 


186 


- 141.5 


5.1 


449 


2 


T45605 


glucosyltrans f eras 


187 


141.5 


5.1 


476 


2 


T03747 


glucosyltransf eras 


188 


141 


5. 1 


394 


2 


S41952 


UTP-glucose glucos 


189 


141 


5.1 


481 


2 


T01850 


UTP-glucose glucos 


190 


140.5 


5 . 1 


346 


2 


S41954 


UTP-glucose glucos 


191 


140.5 


5.1 


486 


2 


T51431 


glucosyltransf eras 


192 


140 


5.1 


480 


2 


F86207 


hypothetical prote 


193 


139.5 


5.0 


452 


2 


F96672 


Similar to Flavono 


194 


138 . 5 


5.0 


444 


2 


G84565 


probable flavonol 


195 


138 . 5 


5 . 0 


449 


2 


S41950 


UTP-glucose glucos 


196 


138 . 5 


5.0 


450 


2 


D86356 


hypothetical prote 


197 


138 . 5 


5.0 


460 


2 


T47709 


glucuronosyl trans 


198 


138 


5 . 0 


385 


2 


T12980 


hypothetical prote 


199 


138 


5 . 0 


438 


2 


T00507 


indole-3-acetate b 


200 


138 


5 . 0 


455 


2 


T00584 


indole-3-acetate b 


201 


137 . 5 


5.0 


467 


2 


T02238 


glucosyl transfera 


202 


136 . 5 


4 . 9 


488 


2 


T07786 


UDP-glucose glucos 


203 


135 


4 . 9 


470 


2 


H84565 


probable flavonol 


C. \J *± 


135 


4 . 9 


490 


2 


T46162 


glucosyltrans f eras 


205 


134 


4 . 8 


455 


2 


T05861 


hypothetical prote 


206 


134 


4 . 8 


456 


2 


C71402 


probable glucosylt 


207 


133 . 5 


4 . 8 


398 


2 


A97146 


probable glycosylt 


208 


132 . 5 


4 . 8 


447 


2 


T12978 


hypothetical prote 


209 


132 . 5 


4 . 8 


447 


2 


E96672 


Similar to Flavono 


210 


131 . 5 


4 . 8 


287 


2 


H69906 


macrolide glycosyl 


211 


130 . 5 


4 . 7 


170 


2 


G86356 


hypothetical prote 


21% 


130 


4 . 7 


455 


1 


XUBHFG 


flavonol 3-O-gluco 


213 


129 . 5 


4 . 7 


400 


2 


C97147 


probable glycosylt 


214 


128 . 5 


4 . 6 


398 


2 


F64456 


hypothetical prote 


215 


128 . 5 


A . 6 


430 


1 


S33184 


glycosyltransf eras 


216 


126 


4 . 6 


440 


2 


T00583 


probable indole-3- 


217 


125.5 


4 . 5 


381 


1 


A69784 


macrolide glycosyl 


218 


120 . 5 


4 . 4 


490 


2 


H84786 


probable glucosylt 


219 


120 


4 . 3 


418 


2 


F75587 


probable glycosylt 


220 


118 . 5 


4 . 3 


418 


2 


JS0636 


glycosyl transfera 


221 


118 . 5 


4 . 3 


453 


2 


T05862 


hypothetical prote 


222 


115 . 5 


4 .2 


433 


2 


H96549 


hypothetical prote 


223 


115 


4 . 2 


615 


2 


D96499 


probable UDP-gluco 


224 


114 


4 . 1 


1274 


2 


JN0015 


trp protein - frui 


225 


113. 5 


4 .1 


460 


2 


F84529 


probable glucosylt 


226 


113.5 


4 . 1 


473 


2 


S60290 


anthocyanin rhamno 


227 


113 


4 . 1 


478 


2 


T05423 


probable glucosylt 


£. C. \J 


112 . 5 


4 . 1 


281 


2 


T29825 


hypothetical prote 


229 


112 . 5 


4 . 1 


732 


2 


T14233 


NADH2 dehydrogenas 


230 


111. 5 


4 . 0 


374 


2 


-C85609 


probable glucosylt 


231 


111 . 5 


4 . 0 


374 


2 


B90800 


probable glucosyl- 


232 


111.5 


4.0 


465 


2 


T48374 


UDPG glucosyltrans 


233 


111 . 5 


4 . 0 


471 


2 


S36655 


UDP rhamnose-antho 


234 


111 


4 . 0 


842 


2 


S56819 


hypothetical prote 


235 


110.5 


4 . 0 


445 


2 


S75596 


hypothetical prote 


236 


109. 5 


4.0 


598 


2 


C97273 


mutS-like mismatch 


237 


109.5 


4.0 


1221 


2 


T25005 


hypothetical prote 


238 


109 


3.9 


408 


2 


T44859 


glycosyltransf eras 


239 


108.5 


3.9 


1232 


2 


D64413 


cobalamin biosynth 


240 


108 


3.9 


3079 


1 


RGBYI2 


probable GTPase-ac 


241 


107 


3.9 


427 


2 


G81260 


UDP-N-acetylmuramo 



242 


106 


3.8 


388 


2 


B70878 


probable transfera 


243 


106 


3.8 . 


482 


2 


E84680 


„ probable glucosylt 


244 


105.5 


3.8 


484 


2 


E84529 


probable glucosylt 


245 


105.5 


3.8 


556 


2 


S51858 


probable membrane 


246 


105.5 


3.8 


570 


2 


D96766 


protein glucosyltr 


247 


105.5 


3.8 


732 


2 


T13814 


NADH2 dehydrogenas 


248 


105 


3.8 


408 


2 


T44860 


glycosyltrans f eras 


249 


103.5 


3.7 


371 


2 


AC0837 


probable glycosylt 


250 


103. 5 


3.7 


371 


2 


T30292 


glucosyl- transfera 


251 


103. 5 


3.7 


448 


2 


C96542 


hypothetical prote 


252 


103. 5 


3.7 


469 


1 


RGKBCP 


nitrogen regulatio 


253 


103.5 


3.7 


622 


2 


A45746 


phosphoenolpyruvat 


254 


103 


3.7 


395 


2 


AB0242 


probable glycosyl 


255 


103 


3.7 


508 


2 


S12640 


transposition prot 


256 


103 


3.7 


967 


2 


A30325 


membrane alanyl am 


257 


102.5 


3.7 


435 


2 


A86195 


hypothetical prote 


258 


102.5 


3.7 


663 


2 


F90291 


hypothetical prote 


259 


102.5 


3.7 


706 


2 


D90124 


hypothetical prote 


260 


102 . 5 


3.7 


1275 


2 


JU0092 


trp protein - frui 


261 


102 


3.7 


334 


2 


H75362 


hypothetical prote 


262 


102 


3.7 


1655 


2 


S47446 


nucleoporin Npl88 


263 


101.5 


3.7 


385 


2 


C71699 


murg protein (murG 


264 


101.5 


3.7 


728 


2 


T13794 


NADH2 dehydrogenas 


265 


101 


3.6 


511 


2 


S43685 


monoamine transpor 


266 


101 


3.6 


693 


2 


T13175 


NADH2 dehydrogenas 


267 


101 


3.6 


697 


2 


T13754 


NADH2 dehydrogenas 


268 


100.5 


3.6 


370 


2 


E96929 


probable permease 


269 


100.5 


3.6 


740 


2 


T12753 


NADH2 dehydrogenas 


270 


100 . 5 


3. 6 


858 


2 


A44919 


GCR3 protein - yea 


271 


100 


3.6 


1058 


2 


D82654 


ankyrin-like prote 


272 


99. 5 


3.6 


396 


2 


B69325 


LPS biosynthesis p 


273 


99 


3.6 


190 


2 


T24652 


hypothetical prote 


274 


99 


3.6 


686 


2 


T13561 


NADH2 dehydrogenas 


275 


99 


3.6 


738 


2 


T12192 


NADH2 dehydrogenas 


276 


98 . 5 


3. 6 


733 


2 


T12220 


NADH2 dehydrogenas 


277 


98.5 


3.6 


942 


2 


S75598 


poly (A) polymerase 


278 


98 . 5 


3. 6 


1049 


2 


S67613 


vacuolar carboxype 


279 


98 


3.5 


382 


2 


T46519 


probable glycosyl 


280 


98 


3.5 


622 


1 


QYCHGC 


phosphoenolpyruvat 


281 


98 


3.5 


1271 


2 


T43269 


microcystin synthe 


282 


97 . 5 


3.5 


387 


2 


A81324 


cyclopropane- fatty 


283 


97 . 5 


3.5 


514 


1 


A44405 


tyrosine decarboxy 


284 


97 . 5 


3.5 


774 


2 


S25284 


protein kinase nek 


285 


97 . 5 


3.5 


1830 


2 


E82909 


conserved hypothet 


286 


97 


3.5 


392 


2 


S72936 


UDP-glucuronosyltr 


287 


97 


3.5 


664 


2 


A72215 


excinuclease ABC c 


288 


96.5 


3.5 


358 


2 


D95041 


hypothetical prote 


289 


96.5 


3.5 


383 


2 


AG2894 


conserved hypothet 


290 


96.5 


3.5 


388 


2 


B97670 


hypothetical prote 


291 


96.5 


3.5 


521 


2 


D75581 


malate synthase - 


292 


96.5 


3.5 


612 


2 


G83307 


hypothetical prote 


293 


96 


3.5 


336 


2 


C71154 


hypothetical prote 


294 


96 


3.5 


544 


2 


S54531 


hypothetical prote 


295 


95.5 


3.5 


469 


2 


S53024 


nitrogen regulatio 


296 


95.5 


3.5 


505 


2 


T00400 


hypothetical prote 


297 


95.5 


3.5 


698 


2 


T12586 


NADH2 dehydrogenas 


298 


95.5 


3.5 


741 


2 


T12707 


NADH2 dehydrogenas 



299 


95 


3.4 


204 


2 


E85660 


hypothetical prote 


300 


95 


3.4 


470 


2 


AG0003 


nitrogen regulatio 


301 


95 


3.4 


535 


2 


E86334 


hypothetical prote 


302 


95 


3.4 


707 


2 


T12665 


NADH2 dehydrogenas 


303 


95 


3.4 


707 


2 


T12751 


NADH2 dehydrogenas 


304 


95 


3.4 


707 


2 


T12752 


NADH2 dehydrogenas 


305 


94.5 


3.4 


282 


2 


AF2432 


hypothetical prote 


306 


94.5 


3.4 


333 


2 


E97257 


spore coat protein 


307 


94.5 


3.4 


427 


2 


G72278 


hypothetical prote 


308 


94. 5 


3.4 


705 


2 


T34477 


hypothetical prote 


309 


94.5 


3.4 


705 


2 


A86497 


conserved hypothet 


310 


94. 5 


3.4 


705 


2 


F72125 


conserved hypothet 


311 


94.5 


3.4 


741 


2 


T13791 


NADH2 dehydrogenas 


312 


94 


3.4 


296 


2 


D48327 


COI intron i3 prot 


313 


94 


3.4 


531 


2 


F70415 


CTP synthetase - A 


314 


94 


3.4 


707 


2 


T12759 


NADH2 dehydrogenas 


315 


94 


3.4 


707 


2 


T12658 


NADH2 dehydrogenas 


316 


94 


3.4 


707 


2 


T12664 


NADH2 dehydrogenas 


317 


93.5 


3.4 


411 


2 


T46681 


probable glycosylt 


318 


93.5 


3.4 


681 


2 


T39814 


hypothetical prote 


319 


93.5 


3.4 


731 


2 


T14231 


NADH2 dehydrogenas 


320 


93.5 


3.4 


734 


2 


T13685 


NADH2 dehydrogenas 


321 


93. 5 


3.4 


763 


2 


H70145 


ATP-dependent Clp 


322 


93. 5 


3.4 


1032 


2 


D95177 


Snf2 family protei 


323 


93. 5 


3.4 


1169 


2 


H70178 


exodeoxyribonuclea 


324 


93. 5 


3.4 


1203 


2 


T28895 


hypothetical prote 


325 


93. 5 


3.4 


1826 


2 


H86502 


excinuclease ABC s 


326 


93. 5 


3.4 


1826 


2 


D72120 


excinuclease ABC, 


327 


93 


3.4 


700 


2 


T12589 


NADH2 dehydrogenas 


328 


93 


3 . 4 


707 


2 


T12668 


NADH2 dehydrogenas 


329 


93 


3.4 


707 


2 


T12671 


NADH2 dehydrogenas 


330 


93 


3.4 


707 


2 


T13036 


NADH2 dehydrogenas 


331 


93 


3.4 


707 


2 


T13032 


NADH2 dehydrogenas 


332 


93 


3.4 


737 


2 


T14232 


NADH2 dehydrogenas 


333 


92 . 5 


3.3 


460 


2 


B48057 


f arnesyl-diphospha 


334 


92 . 5 


3.3 


460 


2 


T40581 


fames yl-diphospha 


335 


92 . 5 


3 . 3 


469 


2 


AC0950 


Two- component syst 


336 


92.5 


3.3 


562 


2 


T34319 


hypothetical prote 


337 


92.5 


3.3 


622 


1 


QYRTGP 


phosphoenolpyruvat 


338 


92. 5 


3.3 


727 


2 


T12221 


NADH2 dehydrogenas 


339 


92. 5 


3.3 


828 


2 


JC5807 


trp3 protein - rat 


340 


92 


3.3 


402 


2 


AE2894 


conserved hypothet 


341 


92 


3.3 


405 


2 


H97669 


hypothetical prote 


342 


92 


3.3 


468 


1 


RGECGG 


nitrogen regulatio 


343 


92 


3.3 


514 


1 


ALBSN 


alpha-amylase (EC 


344 


92 


3.3 


586 


2 


JC5618 


beta-galactosidase 


345 


92 


3.3 


697 


2 


T13662 


NADH2 dehydrogenas 


346 


92 


3.3 


1442 


2 


C82898 


DNA polymerase III 


347 


91. 5 


3.3 


550 


2 


B82330 


glucose- 6-phosphat 


348 


91.5 


3.3 


593 


2 


S17433 


auxin-regulated pr 


349 


91.5 


3.3 


602 


2 


S69198 


prostaglandin G/H 


350 


91.5 


3.3 


645 


2 


T12159 


NADH2 dehydrogenas 


351 


91.5 


3.3 


661 


2 


T13761 


NADH2 dehydrogenas 


352 


91.5 


3.3 


696 


2 


T12160 


NADH2 dehydrogenas 


353 


91.5 


3.3 


732 


2 


T12194 


NADH2 dehydrogenas 


354 


91.5 


3.3 


737 


2 


T12193 


NADH2 dehydrogenas 


355 


91.5 


3.3 


744 


2 


T13485 


NADH2 dehydrogenas 



356 


91.5 


3.3 


1032 


2 


F98043 


SWF/SNF family ATP 


357 


91.5 


3.3 


. 1121 


2 


C82120 


transcription r.egu 


358 


91.5 


3.3 


1199 


2 


G69698 


RNA polymerase (be 


359 


91 


3.3 


364 


2 


D95116 


f errochelatase [im 


360 


91 


3.3 


364 


2 


B97986 


f errochelatase (EC 


361 


91 


3.3 


688 


2 


T13248 


NADH2 dehydrogenas 


362 


91 


3.3 


698 


2 


T12563 


NADH2 dehydrogenas 


363 


91 


3.3 


733 


2 


T12213 


NADH2 dehydrogenas 


364 


91 


3.3 


736 


2 


T12218 


NADH2 dehydrogenas 


365 


91 


3.3 


755 


2 


.T15791 


hypothetical prote 


366 


91 


3.3 


757 


2 


T34362 


hypothetical prote 


367 


90.5 


3.3 


346 


2 


G69325 


LPS glycosyltransf 


368 


90.5 


3.3 


381 


2 


F71196 


probable hexosyltr 


369 


90.5 


3.3 


602 


2 


A35564 


prostaglandin-endo 


370 


90.5 


3.3 


734 


2 


T12602 


NADH2 dehydrogenas 


371 


90.5 


3.3 


741 


2 


T12642 


NADH2 dehydrogenas 


372 


90.5 


3.3 


749 


2 


T13034 


NADH2 dehydrogenas 


373 


90.5 


3.3 


2783 


2 


T34416 


hypothetical prote 


374 


90 


3.3 


356 


2 


G72386 


conserved hypothet 


375 


90 


3.3 


479 


2 


C75099 


hypothetical prote 


376 


90 


3.3 


480 


2 


T34294 


hypothetical prote 


377 


90 


3.3 


557 


2 


H72400 


excinuclease ABC c 


378 


90 


3.3 


665 


2 


T13488 


NADH2 dehydrogenas 


379 


90 


3.3 


683 


2 


T12153 


NADH2 dehydrogenas 


380 


90 


3.3 


688 


2 


T13249 


NADH2 dehydrogenas 


381 


90 


3.3 


699 


2 


T12667 


NADH2 dehydrogenas 


382 


90 


3.3 


699 


2 


T12673 


NADH2 dehydrogenas 


383 


90 


3.3 


702 


2 


T13058 


NADH2 dehydrogenas 


384 


90 


3.3 


817 


2 


D86217 


protein T27G7.8 [i 


385 


90 


3.3 


1157 


2 


A55152 


PAS1 protein - yea 


386 


90 


3.3 


1953 


2 


S63244 


BNI1 protein - yea 


387 


89.5 


3.2 


274 


1 


D70378 


conserved hypothet 


388 


89.5 


3.2 


289 


2 


T34454 


hypothetical prote 


389 


89.5 


3.2 


422 


2 


C71930 


udp-n-acetylmuramo 


390 


89.5 


3.2 


468 


2 


F84686 


hypothetical prote 


391 


89.5 


3.2 


500 


2 


156601 


cytochrome P4 50 al 


392 


89.5 


3.2 


580 


2 


T31529 


hypothetical prote 


393 


89.5 


3.2 


657 


2 


E81119 


tail fibre protein 


394 


89.5 


3.2 


657 


2 


E81901 


probable phage tai 


395 


89.5 


3.2 


744 


2 


T12758 


NADH2 dehydrogenas 


396 


89.5 


3.2 


930 


2 


E98059 


isoleucine-tRNA li 


397 


89.5 


3.2 


3268 


2 


S69625 


hypothetical prote 


398 


89 


3.2 


276 


2 


B86710 


intercellular adhe 


399 


89 


3.2 


298 


2 


A75119 


hypothetical prote 


400 


89 


3.2 


377 


2 


T46149 


protein kinase ATN 


401 


.89 


3.2 


431 


2 


140898 


3-deox.y-manno-octu 


402 


89 


3.2 


686 


2 


T13234 


NADH2 dehydrogenas 


403 


89 


3.2 


686 


2 


T13252 


NADH2 dehydrogenas 


404 


89 


3.2 


686 


2 


T13768 


NADH2 dehydrogenas 


405 


89 


3.2 


686 


2 


T13490 


NADH2 dehydrogenas 


406 


89 


3.2 


686 


2 


T13766 


NADH2 dehydrogenas 


407 


89 


3.2 


688 


2 


T13282 


NADH2 dehydrogenas 


408 


89 


3.2 


688 


2 


T13278 


NADH2 dehydrogenas 


409 


89 


3.2 


688 


2 


T13242 


NADH2 dehydrogenas 


410 


89 


3.2 


688 


2 


T13373 


NADH2 dehydrogenas 


411 


89 


3.2 


688 


2 


T13243 


NADH2 dehydrogenas 


412 


89 


3.2 


688 


2 


T13253 


NADH2 dehydrogenas 



413 


89 


3.2 


688 


2 


T12670 


NADH2 dehydrogenas 


414 


89 


3.2 


696 


2 


T12659 


NADH2 dehydrogenas 


415 


89 


3.2 


990 


2 


D83706 


lantibiotic mersac 


416 


89 


3.2 


1058 


2 


T19282 


hypothetical prote 


417 


89 


3.2 


3147 


2 


T21328 


hypothetical prote 


418 


88.5 


3.2 


252 


2 


S50806 


hypothetical prote 


419 


88.5 


3.2 


396 


2 


A59226 


tauropine dehydrog 


420 


88.5 


3.2 


425 


2 


H90415 


hypothetical prote 


421 


88.5 


3.2 


500 


1 


DCJAAP 


aromatic-L-amino-a 


422 


88.5 


3.2 


647 


1 


QYFFGM 


phosphoenolpyruvat 


423 


88.5 


3.2 


721 


2 


T12212 


NADH2 dehydrogenas 


424 


88.5 


3.2 


737 


2 


T13504 


NADH2 dehydrogenas 


425 


88.5 


3.2 


741 


2 


T13361 


NADH2 dehydrogenas 


426 


88.5 


3.2 


741 


2 


T13774 


NADH2 dehydrogenas 


427 


88.5 


3.2 


899 


2 


H96617 


probable disease r 


428 


88.5 


3.2 


1435 


2 


D96693 


protein Putative A 


429 


88.5 


3.2 


1561 


1 


C69145 


protoporphyrin IX 


430 


88.5 


3.2 


1640 


2 


H88094 


protein F39E9.2 [i 


431 


88 


3.2 


405 


2 


D82542 


two- component syst 


432 


88 


3.2 


431 


2 


S41168 


3-deoxy-manno-octu 


433 


88 


3.2 


431 


2 


140897 


3-deoxy-manno-octu 


434 


88 


3.2 


628 


2 


S51422 


probable membrane 


435 


88 


3.2 


694 


2 


T12712 


NADH2 dehydrogenas 


436 


88 


3.2 


694 


2 


T12675 


NADH2 dehydrogenas 


437 


88 


3.2 


698 


2 


T12588 


NADH2 dehydrogenas 


438 


88 


3.2 


699 


2 


T17119 


NADH2 dehydrogenas 


439 


88 


3.2 


729 


2 


T12227 


NADH2 dehydrogenas 


440 


88 


3.2 


731 


2 


T12226 


NADH2 dehydrogenas 


441 


88 


3.2 


1244 


2 


C96584 


hypothetical prote 


442 


88 


3.2 


5138 


2 


B96695 


hypothetical prote 


443 


87.5 


3.2 


399 


2 


G70328 


chromosome replica 


444 


87.5 


3.2 


412 


2 


JC1452 


translation initia 


445 


87.5 


3.2 


432 


2 


A64602 


processing protein 


446 


87.5 


3.2 


447 


2 


B98052 


conserved hypothet 


447 


87.5 


3.2 


472 


2 


G81293 


probable type II p 


448 


87.5 


3.2 


509 


2 


T00793 


hypothetical prote 


449 


87.5 


3.2 


578 


2 


D89772 


hypothetical prote 


450 


87.5 


3.2 


635 


2 


T46407 


probable RNA helic 


451 


87.5 


3.2 


673 


2 


T20936 


hypothetical prote 


452 


87.5 


3.2 


744 


2 


T13040 


NADH2 dehydrogenas 


453 


87.5 


3.2 


749 


2 


JC7729 


replication protei 


454 


87.5 


3.2 


995 


2 


T27327 


hypothetical prote 


455 


87.5 


3.2 


1377 


2 


T19214 


UDP-glucose-glycop 


456 


87.5 


3.2 


1452 


1 


S17669 


protein-tyrosine-p 


457 


87.5 


3.2 


1597 


2 


S55144 


RLR1 protein - yea 


458 


87.5 


3.2 


18.3.0 


1 


S19188 


myosin-V - chicken 


459 


87 


3.1 


141 


2 


T10306 . 


hypothetical prote 


460 


87 


3.1 


214 


2 


T49174 


hypothetical prote 


461 


87 


3.1 


333 


2 


AF2124 


hypothetical prote 


462 


87 


3.1 


412. 


2 


T47321 


hypothetical prote 


463 


87 


3.1 


430 


2 


F81698 


3-deoxy-manno-octu 


464 


87 


3.1 


431 


2 


140894 


3-deoxy-manno-octu 


465 


87 


3.1 


500 


2 


T33279 


hypothetical prote 


466 


87 


3.1 


511 


2 


S43686 


monoamine transpor 


467 


87 


3.1 


512 


2 


C64599 


hypothetical prote 


468 


87 


3.1 


551 


2 


T50663 


RCCl-like G exchan 


469 


87 


3.1 


661 


2 


F90360 


hypothetical prote 



470 


87 


3.1 


684 


2 


T12151 


NADH2 dehydrogenas 


471 


87 


3.1 


694 


2 


T13573 


NADH2 dehydrogenas 


472 


87 


3.1 


698 


2 


T12562 


NADH2 dehydrogenas 


473 


87 


3.1 


734 


1 


DERZN5 


NADH2 dehydrogenas 


474 


87 


3.1 


736 


2 


T12217 


NADH2 dehydrogenas 


475 


87 


3.1 


839 


2 


S73548 


MG422 homolog C12_ 


476 


87 


3.1 


1037 


2 


T18365 


polys ialyl trans f er 


477 


87 


3.1 


1037 


2 


T18368 


polysialyl trans f er 


478 


87 


3.1 


1167 


2 


F71909 


hypothetical prote 


479 


87 


3.1 


1411 


2 


S55123 


hypothetical prote 


480 


87 


3.1 


1581 


2 


T30832 


pentaf unctional en 


481 


87 


3.1 


1828 


2 


B59254 


mysoin heavy chain 


482 


87 


3.1 


1853 


1 


A46761 


myosin heavy chain 


483 


87 


3.1 


1855 


2 


A59254 


mysoin heavy chain 


484 


87 


3.1 


4717 


2 


T41581 


hypothetical coile 


485 


86.5 


3.1 


316 


2 


C96020 


probable sugar upt 


486 


86.5 


3.1 


358 


2 


AB2041 


hypothetical prote 


487 


86.5 


3.1 


359 


2 


A97175 


undecaprenyl-PP-Mu 


488 


86.5 


3.1 


376 


2 


AG2227 


DNA repair' and gen 


489 


86.5 


3.1 


409 


2 


F71956 


hypothetical prote 


490 


86.5 


3.1 


413 


2 


D72279 


conserved hypothet 


491 


86.5 


3.1 


417 


2 


F96607 


probable clathrin- 


492 


86.5 


3.1 


469 


2 


E86074 


hypothetical prote 


493 


86.5 


3.1 


469 


2 


F91227 


response regulator 


494 


86.5 


3.1 


495 


2 


T46700 


hypothetical prote 


495 


86.5 


3.1 


563 


2 


T44214 


probable phosphotr 


496 


86.5 


3.1 


617 


2 


S37744 


endo-exonuclease y 


497 


86.5 


3.1 


622 


2 


JC7973 


synleurin - human 


498 


86.5 


3.1 


682 


2 


T12715 


NADH2 dehydrogenas 


499 


86.5 


3.1 


733 


2 


T22011 


hypothetical prote 


500 


86.5 


3.1 


741 


2 


T12691 


NADH2 dehydrogenas 


501 


86.5 


3.1 


741 


2 


T12762 


NADH2 dehydrogenas 


502 


86.5 


3.1 


741 


2 


T13372 


NADH2 dehydrogenas 


503 


86.5 


3.1 


741 


2 


T13509 


NADH2 dehydrogenas 


504 


86.5 


3.1 


741 


2 


T13765 


NADH2 dehydrogenas 


505 


86.5 


3.1 


744' 


2 


T12627 


NADH2 dehydrogenas 


506 


86.5 


3.1 


744 


2 


T13585 


NADH2 dehydrogenas 


507 


86.5 


3.1 


747 


2 


T13683 


NADH2 dehydrogenas 


508 


86.5 


3.1 


892 


2 


AE0650 


alcohol dehydrogen 


509 


86.5 


3.1 


1076 


2 


A69409 


carbamoyl-phosphat 


510 


86.5 


3.1 


1575 


2 


S68448 


synapto janin, 17 OK 


511 


86.5 


3.1 


1702 


2 


S42459 


DNA-directed DNA p 


512 


86 


3.1 


342 


2 


T48045 


hypothetical prote 


513 


86 


3.1 


410 


2 


A64234 


ATP-dependent RNA 


514 


86 


3.1 


649 


1 


TWPMK 


large T antigen - 


515 


86 


3.1 


684 


2 


T13491 


NADH2 dehydrogenas 


516 


86 


3.1 


700 


2 


T13763 


NADH2 dehydrogenas 


517 


86 


3.1 


703 


2 


T13074 


NADH2 dehydrogenas 


518 


86 


3.1 


706 


2 


A48084 


STE11 protein kina 


519 


86 


3.1 


713 


2 


JC2534 


RVLG protein - rat 


520 


86 


3.1 


746 


2 


S08494 


NADH2 dehydrogenas 


521 


86 


3.1 


881 


2 


S67026 


probable membrane 


522 


86 


3.1 


891 


1 


DEEC 


acetaldehyde dehyd 


523 


86 


3.1 


891 


2 


D85704 


hypothetical prote 


524 


86 


3.1 


891 


2 


E90846 


acetaldehyde dehyd 


525 


86 


3.1 


1025 


2 


H81751 


exodeoxyribonuclea 


526 


86 


3.1 


3194 


2 


D71917 


toxin-like outer m 



527 


86 


3.1 


5175 


2 


T20992 


hypothetical prote 


528 


86 


3.1 - 


5198 


2 


T43290 


hemicentin precurs 


529 


85.5 


3.1 


332 


2 


T32852 


hypothetical prote 


530 


85.5 


3.1 


338 


2 


G97771 


capM protein [impo 


531 


85.5 


3.1 


352 


2 


D97947 


UDP-N-acetylglucos 


532 


85.5 


3.1 


356 


2 


H81800 


conserved hypothet 


533 


85.5 


3.1 


356 


2 


F81068 


conserved hypothet 


534 


85.5 


3.1 


406 


2 


C72340 


probable hexosyltr 


535 


85.5 


3.1 


467 


2 


AG0546 


probable terminal 


536 


85.5 


3.1 


469 


2 


B70486 


hypothetical prote 


537 


85.5 


3.1 


491 


2 


C86781 


alpha-amylase [imp 


538 


85.5 


3.1 


492 


2 


S32491 


testosterone 7alph 


539 


85.5 


3.1 


494 


2 


B89870 


hypothetical prote 


540 


85.5 


3.1 


506 


2 


F83545 


hypothetical prote 


541 


85.5 


3.1 


546 


2 


D89813 


alpha-glucosidase 


542 


85.5 


3.1 


557 


2 


E83744 


hypothetical prote 


543 


85.5 


3.1 


677 


2 


F64139 


guanosine-3 1 r 5 1 -bi 


544 


85.5 


3.1 


681 


2 


T13566 


NADH2 dehydrogenas 


545 


85.5 


3.1 


702 


2 


T13708 


NADH2 dehydrogenas 


546 


85.5 


3.1 


731 


2 


T16524 


hypothetical prote 


547 


85.5 


3.1 


740 


1 


DENTN5 


NADH2 dehydrogenas 


548 


85.5 


3.1 


741 


2 


T12711 


NADH2 dehydrogenas 


549 


85.5 


3.1 


741 


2 


T13054 


NADH2 dehydrogenas 


550 


85.5 


3.1 


741 


2 


T13508 


NADH2 dehydrogenas 


551 


85.5 


3.1 


741 


2 


T13776 


NADH2 dehydrogenas 


552 


85.5 


3.1 


741 


2 


T12692 


NADH2 dehydrogenas 


553 


85.5 


3.1 


741 


2 


T12706 


NADH2 dehydrogenas 


554 


85.5 


3.1 


741 


2 


T13042 


NADH2 dehydrogenas 


555 


85.5 


3.1 


741 


2 


T13247 


NADH2 dehydrogenas 


556 


85.5 


3.1 


741 


2 


T13398 


NADH2 dehydrogenas 


557 


85.5 


3.1 


743 


2 


T12626 


NADH2 dehydrogenas 


558 


85.5 


3.1 


743 


2 


T13238 


NADH2 dehydrogenas 


559 


85.5 


3.1 


744 


2 


T12603 


NADH2 dehydrogenas 


560 


85.5 


3.1 


744 


2 


T13755 


NADH2 dehydrogenas 


561 


85.5 


3.1 


749 


2 


T12623 


NADH2 dehydrogenas 


562 


85.5 


3.1 


891 


2 


AG0265 


alcohol dehydrogen 


563 


85.5 


3.1 


1173 


2 


T42719 


TPR-containing/SH2 


564 


85 


3.1 


180 


2 


AF1175 


transcription regu 


565 


85 


3.1 


185 


2 


A82569 


hypoxanthine-guani 


566 


85 


3.1 


217 


2 


H90143 


hypothetical prote 


567 


85 


3.1 


283 


2 


T27423 


hypothetical prote 


568 


85 


3.1 


418 


2 


G75496 


UDP-N-acetylglucos 


569 


85 


3.1 


458 


2 


F84746 


probable serine ca 


570 


85 


3.1 


542 


2 


JN0438 


carboxylesterase ( 


571 


85 


3.1 


584 


2 


T14631 


glucose- 6-phosphat 


572 


85 


3.1 


614 


2 


T29937 


hypothetical prote 


573 


85 


3.1 


656 


2 


A96724 


hypothetical prote 


574 


85 


3.1 


696 


2 


T12663 


NADH2 dehydrogenas 


575 


85 


3.1 


698 


2 


T12560 


NADH2 dehydrogenas 


576 


85 


3.1 


699 


2 


T12163 


NADH2 dehydrogenas 


.577 


85 


3.1 


699 


2 


T12169 


NADH2 dehydrogenas 


578 


85 


3.1 


699 


2 


T12167 


NADH2 dehydrogenas 


579 


85 


3.1 


699 


2 


T12172 


NADH2 dehydrogenas 


580 


85 


3.1 


699 


2 


T12164 


NADH2 dehydrogenas 


581 


85 


3.1 


699 


2 


T12168 


NADH2 dehydrogenas 


582 


85 


3.1 


700 


2 


T13668 


NADH2 dehydrogenas 


583 


85 


3.1 


700 


2 


T13702 


NADH2 dehydrogenas 



584 


85 


3.1 


701 


2 


T13056 


NADH2 dehydrogenas 


585 


85 


3.1 


705 


2 


T13494 


NADH2 dehydrogenas 


586 


85 


3.1 


721 


2 


T14229 


NADH2 dehydrogenas 


587 


85 


3.1 


736 


2 


T12225 


NADH2 dehydrogenas 


588 


85 


3.1 


752 


2 


H86770 


hypothetical prote 


589 


85 


3.1 


756 


2 


T05829 


hypothetical prote 


590 


85 


3.1 


779 


2 


B81287 


hypothetical prote 


591 


85 


3.1 


824 


2 


B38423 


protein-glut amine 


592 


85 


3.1 


893 


2 


E72232 


DNA-directed DNA p 


593 


85 


3.1 


988 


2 


E96621 


hypothetical prote 


594 


85 


3.1 


1039 


2 


T28644 


Y4jQ protein - Rhi 


595 


85 


3. 1 


1045 


2 


E90705 


hypothetical prote 


596 


85 


3. 1 


1045 


2 


H85555 


hypothetical prote 


597 


85 


3. 1 


1058 


2 


S08436 


pol polyprotein - 


598 


85 


3. 1 


1238 


2 


T15824 


hypothetical prote 


599 


85 


3.1 


1721 


1 


138902 


retinoblastoma bin 


600 


84 . 5 


3.1 


221 


2 


D70243 


hypothetical prote 


601 


84 . 5 


3. 1 


352 


2 


A95080 


hypothetical prote 


602 


84.5 


3 . 1 


409 


2 


C64553 


chlorohydrolase - 


603 


84 . 5 


3. 1 


428 


2 


A30108 


cyclin B - Atlanti 


604 


84 . 5 


3.1 


447 


2 


B95185 


Mur ligase family 


605 


84 . 5 


3 . 1 


471 


1 


A44872 


6-phosphof ructo-2- 


606 


84 . 5 


3. 1 


471 


1 


S12732 


6-phosphof ructo-2- 


607 


84.5 


3 . 1 


558 


2 


C88996 


protein C17B7.8 [i 


608 


84 . 5 


3. 1 


563 


2 


T44029 


ganciclovir kinase 


609 


84 . 5 


3 . 1 


580 


2 


C81352 


lipid export ABC t 


610 


84 . 5 


3 . 1 


738 


2 


S58612 


NADH2 dehydrogenas 


611 


84 . 5 


3. 1 


741 


2 


T13760 


NADH2 dehydrogenas 


612 


84.5 


3 . 1 


741 


2 


T13658 


NADH2 dehydrogenas 


613 


84 . 5 


3 . 1 


741 


2 


T13677 


NADH2 dehydrogenas 


614 


84 . 5 


3.1 


742 


2 


T13245 


NADH2 dehydrogenas 


615 


84 . 5 


3 . 1 


742 


2 


T13369 


NADH2 dehydrogenas 


616 


84 . 5 


3 . 1 


743 


2 


T12690 


NADH2 dehydrogenas 


617 


84 . 5 


3. 1 


744 


2 


T13063 


NADH2 dehydrogenas 


618 


84 ; 5 


3.1 


744 


2 


T12694 


NADH2 dehydrogenas 


619 


84 . 5 


3 . 1 


744 


2 


T12611 


NADH2 dehydrogenas 


620 


84 . 5 


3. 1 


744 


2 


T12761 


NADH2 dehydrogenas 


621 


84 . 5 


3. 1 


745 


2 


T12189 


NADH2 dehydrogenas 


622 


84.5 


3.1 


822 


2 


B97839 


ATP-dependent heli 


623 


84 . 5 


3.1 


850 


2 


A84685 


probable RNA methy 


624 


84 . 5 


3 . 1 


940 


2 


B81852 


probable type III 


625 


84 . 5 


3 . 1 


941 


2 


T49136 . 


protein kinase-lik 


626 


84 . 5 


3.1 


1452 


1 


S17670 


protein-tyrosine-p 


627 


84 . 5 


3.1 


1939 


2 


T18372 


repeat organellar 


628 


84 . 5 


3. 1 


3071 


2 


T50345 


vacuolar protein s 


629 


84 


3.0 


292 


2 


H70313 


cobalamin synthesi 


630 


84 


3.0 


345 


2 


B83260 


probable transcrip 


631 


84 


3.0 


446 


2 


S34570 


beta-glucosidase ( 


632 


84 


3. 0 


504 


2 


AC1740 


S. pyogenes RofA r 


633 


84 


3. 0 


525 


2 


T25550 


hypothetical prote 


634 


84 


3.0 


600 


2 


T24626 


hypothetical prote 


635 


84 


3.0 


621 


2 


T47843 


dynamin-like prote 


636 


84 


3.0 


686 


2 


T13680 


NADH2 dehydrogenas 


637 


84 


3.0 


686 


2 


T13235 


NADH2 dehydrogenas 


638 


84. 


3.0 


688 


2 


T13237 


NADH2 dehydrogenas 


639 


84 


3.0 


689 


2 


T13771 


NADH2 dehydrogenas 


640 


84 


3.0 


692 


2 


T12587 


NADH2 dehydrogenas 



641 


84 


3.0 


696 


2 


T13767 


NADH2 dehydrogenas 


642 


84 


3.0 


698 


2 


T12556, 


NADH2 dehydrogenas 


643 


84 


3.0 


741 


2 


T13086 


NADH2 dehydrogenas 


644 


84 


3.0 


778 


2 


D72421 


xylosidase - Therm 


645 


84 


3.0 


805 


2 


S07099 


membrane alanyl am 


646 


84 


3.0 


950 


2 


A71655 


hypothetical prote 


647 


84 


3.0 


1122 


2 


S64443 


probable membrane 


64 8 


84 


3.0 


1171 


2 


G97174 


pyruvate : f erredoxi 


649 


84 


3.0 


1219 


2 


H84464 


probable helicase 


650 


84 


3.0 


1254 


2 


S46636 


hypothetical prote 


651 


84 


3.0 


1311 


2 


T08986 


hypothetical prote 


652 


84 


3.0 


1630 


2 


S64403 


ESP1 protein - yea 


653 


84 


3.0 


1683 


2 


S56811 


probable membrane 


654 


84 


3.0 


1966 


2 


T08991 


hypothetical prote 


655 


83. 5 


3.0 


261 


2 


F71693 


probable enoyl-[ac 


656 


83.5 


3.0 


275 


2 


C81700 


3-deoxy-manno-octu 


657 


83. 5 


3.0 


305 


2 


D97169 


uncharacterized pr 


658 


83.5 


3.0 


365 


2 


S54049 


hypothetical prote 


659 


83.5 


3.0 


393 


2 


G83749 


methionine gamma 1 


660 


83.5 


3.0 


395 


2 


D89986 


hypothetical prote 


661 


83.5 


3. 0 


411 


2 


G75150 


3-hydroxy-3-methyl 


662 


83.5 


3.0 


429 


2 


B97500 


ribonuclease D (PA 


663 


83. 5 


3.0 


544 


2 


AI3266 


integral membrane 


664 


83. 5 


3. 0 


554 


1 


ZABPG4 


replication initia 


665 


83. 5 


3 . 0 


574 


2 


S45754 


probable membrane 


666 


83. 5 


3.0 


585 


2 


G86200 


protein F12K11.15 


667 


83. 5 


3.0 


602 


2 


S71557 


pyruvate decarboxy 


668 


83.5 


3.0 


685 


2 


T12145 


NADH2 dehydrogenas 


669 


83.5 


3.0 


697 


2 


T13370 


NADH2 dehydrogenas 


670 


83.5 


3.0- 


702 


2 


T13409 


NADH2 dehydrogenas 


671 


83.5 


3 . 0 


703 


2 


T13696 < 


NADH2 dehydrogenas 


672 


83.5 


3.0 


717 


2 


S23098 


guanylate cyclase 


673 


83.5 


3. 0 


738 


2 


T13406 


NADH2 dehydrogenas 


674 


83. 5 


3.0 


741 


2 


T13251 


NADH2 dehydrogenas 


675 


83. 5 


3.0 


741 


2 


T13697 


NADH2 dehydrogenas 


67 6 


83. 5 


3.0 


741 


2 


T13377 


NADH2 dehydrogenas 


677 


83. 5 


3. 0 


741 


2 


T13706 


NADH2 dehydrogenas 


678 


83.5 


3.0 


743 


2 


T12760 


NADH2 dehydrogenas 


679 


83 . 5 


3. 0 


744 


2 


T13043 


NADH2 dehydrogenas 


680 


83.5 


3.0 


744 


2 


T13682 


NADH2 dehydrogenas 


681 


83.5 


3.0 


930 


2 


A95193 


isoleucyl-tRNA syn 


682 


83. 5 


3.0 


1115 


2 


S57726 


PAT1 protein - sli 


683 


83.5 


3.0 


1127 


2 


T03105 


major single-stran 


684 


83.5 


3.0 


1133 


2 


JT0665 


helicase II-like p 


685 


83.5 


3.0 


1157 


2 


AD1728 


ATP-dependent deox 


686 


83.5 


3.0 


1181 


2 


T20386 


hypothetical prote 


687 


83.5 


3.0 


1502 


2 


S45429 


probable membrane 


688 


83.5 


3.0 


2265 


2 


T26183 


hypothetical prote 


689 


83 


3.0 


180 


2 


AD1532 


transcription regu 


690 


83 


3.0 


357 


2 


A65163 


lipopolysaccharide 


691 


83 


3.0 


431 


2 


F96764 


hypothetical prote 


692 


83 


3.0 


450 


2 


T39088 


probable vacuolar 


693 


83 


3.0 


499 


2 


T45946 


probable protein-k 


694 


83 


3.0 


516 


2 


T27092 


hypothetical prote 


695 


83 


3.0 


577 


2 


T43207 


nuclear fusion pro 


696 


83 


3.0 


590 


2 


T48625 


pescadillo-like pr 


697 


83 


3.0 


592 


2 


T48155 


pyruvate decarboxy 



698 


83 


3.0 


604 


2 


F89453 


699 


83 


3.0 


652 


2 


T38704 


700 


83 


3.0 


688 


2 


T09604 


701 


83 


3.0 


688 


2 


T09836 


702 


83 


3.0 


689 


2 


T13762 


703 


83 


3.0 


691 


2 


T12293 


704 


83 


3.0 


698 


2 


T12565 


705 


83 


3.0 


699 


2 


T12170 


706 


83 


3.0 


699 


2 


T12173 


707 


83 


3.0 


700 


2 


T12162 


708 


83 


3.0 


731 


2 


T31914 


709 


83 


3.0 


736 


2 


T12222 


710 


83 


3.0 


746 


2 


S71892 


711 


83 


3.0 


774 


2 


D97527 


712 


83 


3.0 


774 


2 


AE2746 


713 


83 


3.0 


864 


2 


D70183 


714 


83 


3.0 


893 


2 


F86476 


715 


83 


3.0 


1328 


2 


S78457 


716 


83 


3.0 


1357 


2 


S57052 


717 


83 


3.0 


4924 


2 


T50176 


718 


82.5 


3.0 


297 


2 


AC1494 


719 


82.5 


3.0 


335 


2 


A70128 


720 


82.5 


3.0 


356 


2 


B89919 


721 


82.5 


3.0 


385 


2 


S49111 


722 


82.5 


3.0 


426 


2 


C83212 


723 


82.5 


3.0 


426 


2 


B53652 


724 


82.5 


3.0 


443 


2 


T21598 


725 


82.5 


3.0 


456 


2 


C48572 


726 


82.5 


3.0 
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2 


T29968 
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82 . 5 
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2 


A82083 
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2 


T03295 
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2 


F81287 
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2 


T12124 


731 
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724 


2 


C83706 
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82.5 


3.0 
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1 


VCLJKX 


733 
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3.0 
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2 


T13792 


734 


82 . 5 


3.0 
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2 


T17469 
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82.5 


3.0 


738 


2 


T13396 
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741 


2 


T13404 
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T12757 
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2 


T13378 
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T13493 
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2 


JQ2209 
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1120 


2 


H71664 


748 


82.5 


3.0 


1124 


2 


S06856 


749 


82.5 


3.0 


1291 


2 


T22382 


750 


82.5 


3.0 


1335 


2 


T18289 
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82.5 


3.0 


4452 


1 


YGBSG2 
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82 


3.0 


54 


2 


A46292 
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82 
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2 


S74012 
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2 
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755 


82 


3.0 


388 


2 


F89773 


hypothetical prote 


756 


82 


3.0 


413 


2 


.S52020 


translation initia 


757 


82 


3.0 


436 


2 


T38812 


hypothetical prote 


758 


82 


3.0 


437 


2 


T48035 


hypothetical prote 


759 


82 


3.0 


437 


2 


T39592 


phosphatidyl serine 


760 


82 


3.0 


442 


2 


D72498 


probable serine hy 


761 


82 


' 3.0 


462 


2 


JC7805 


toxin-A - Chiropsa 


762 


82 


3.0 


517 


2 


F64560 


CMP-N-acetylneuram 


763 


82 


3.0 


535 


2 


A71319 


probable glucose-6 


764 


82 


3.0 


539 


2 


T32052 


hypothetical prote 


765 


82 


3.0 


553 


2 


C81024 


hypothetical prote 


766 


82 


3.0 


574 


2 


A86255 


hypothetical prote 


767 


82 


3.0 


602 


2 


S72513 


FOG2 protein - yea 


768 


82 


3.0 


646 


2 


B70396 


histidine kinase s 


769 


82 


3.0 


683 


2 


T12295 


NADH2 dehydrogenas 


770 


82 


3.0 


684 


2 


T13695 


NADH2 dehydrogenas 


771 


82 


3.0 


690 


2 


T12763 


NADH2 dehydrogenas 


772 


82 


3.0 


710 


2 


T12143 


NADH2 dehydrogenas 


773 


82 


3.0 


714 


2 


G86844 


hypothetical prote 


774 


82 


3.0 


741 


2 


T12422 


NADH2 dehydrogenas 


775 


82 


3.0 


883 


2 


T23948 


hypothetical prote 


776 


82 


3.0 


913 


2 


T31497 


hypothetical prote 


777 


82 


3.0 


964 


2 


G96662 


probable aminopept 


778 


82 


3.0 


1005 


2 


F90099 


hypothetical prote 


779 


82 


3.0 


1022 


2 


S50534 


hypothetical prote 


780 


82 


3.0 


1024 


2 


G72041 


exodeoxyribonuclea 


781 


82 


3.0 


1024 


2 


F86582 


exodeoxyribonuclea 


782 


82 


3.0 


1024 


2 


D81624 


exodeoxyribonuclea 


783 


82 


3.0 


1047 


2 


E64790 


ybdE protein - Esc 


784 


82 


3.0 


1526 


2 


T41522 


myosin ii - fissio 


785 


82 


3.0 


1957 


2 


T38077 


hypothetical coile 


786 


82 


3.0 


3005 


1 


GNVSTV 


genome polyprotein 


787 


81.5 


2.9 


305 


2 


T32235 


hypothetical prote 


788 


81.5 


2.9 


306 


2 


G96935 


mccF-like protein 


789 


81.5 


2.9 


341 


2 


E71191 


probable 3-hydroxy 


790 


81.5 


2.9 


360 


2 


C72356 


hypothetical prote 


791 


81.5 


2.9 


370 


2 


D83793 


hypothetical prote 
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81.5 


2.9 


418 


2 


B72245 


conserved hypothet 
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81.5 


2.9 


421 


2 


S73010 


hypothetical prote 
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81.5 


2.9 


426 


2 


F70723 


probable trans f era 


795 


81.5 


2.9 


429 


2 


E84952 


threonine synthase 
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81.5 


2.9 


473 


2 


T24686 


hypothetical prote 
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81.5 


2.9 


479 


1 


A59380 


protein kinase (EC 


798 


81.5 


2.9 


525 


2 


T41427 


membrane transport 


799 


81.5 


2.9 


544 


2 


T51162 


aminoglycoside ace 


800 


81.5 


2.9.. 


572 


2 


T51525 


hypothetical prote 


801 


81.5 


2.9 


602 


2 


S39782 


cyclooxygenase 1 - 


802 


81.5 


2.9 


623 


2 


G72412 


hypothetical prote 


803 


81.5 


2.9 


638 


2 


H83905 


hypothetical prote 


804 


81.5 


2.9 


697 


2 


T39512 


hypothetical prote 


805 


81.5 


2.9 


698 


2 


T13657 


NADH2 dehydrogenas 
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81.5 


2.9 


715 


2 


S77439 


hypothetical, prote 


807 


81.5 


2.9 


717 


1 


VGBE11 


glycoprotein H pre 


808 


81.5 


2.9 


739 


2 


JS0675 


vascular cell adhe 


809 


81.5 


2.9 


741 


2 


T12610 


NADH2 dehydrogenas 


810 


81.5 


2.9 


741 


2 


T13085 


NADH2 dehydrogenas 


811 


81.5 


2.9 


741 


2 


T12699 


NADH2 dehydrogenas 
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81.5 


2.9 


741 


2 


T13051 


NADH2 dehydrogenas 


813 


81.5 


2.9 


741 


2 


T13233 


NADH2 dehydrogenas 


814 


81.5 


2.9 


741 


2 


T13569 


NADH2 dehydrogenas 


815 


81.5 


2.9 


741 


2 


T13777 


NADH2 dehydrogenas 


816 


81.5 


2.9 


744 


2 


G90124 


hypothetical prote 


817 


81.5 


2. 9 


794 


2 


S64362 


probable membrane 


818 


81.5 


2.9 


825 


2 


T23612 


hypothetical prote 


819 


81.5 


2.9 


836 


2 


A54269 


protein-glutamine 


820 


81 . 5 


2. 9 


1161 


2 


B86368 


protein F28C11.9 [ 


821 


81.5 


2.9 


1522 


2 


S48904 


probable purine nu 


822 


81.5 


2. 9 


1545 


1 


S71841 


multidrug resistan 


823 


81 


2.9 


296 


2 


E71118 


hypothetical prote 


824 


81 


2.9 


299 


2 


H90033 


hypothetical prote 


825 


81 


2 . 9 


355 


2 


S52022 


translation initia 


826 


81 


2.9 


364 


2 


S44899 


ZK1236.4 protein - 


827 


81 


2. 9 


413 


2 


S52017 


translation initia 


828 


81 


2.9 


473 


2 


A56377 


rubber particle cy 


829 


81 


2.9 


492 


2 


T38093 


probable protein d 


830 


81 


2 . 9 


503 


2 


S14275 


steroid 6beta-mono 


831 


81 


2. 9 


509 


2 


T37587 


hypothetical trp-a 


832 


81 


2 . 9 


532 


2 


S54571 


probable membrane 
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81 


2. 9 


541 


2 


A70141 


oligopeptide ABC t 


834 


81 


2. 9 


554 


2 


T13500 


NADH2 dehydrogenas 


835 


81 


2. 9 


649 


2 


T45102 


H+-transporting tw 


836 


81 


2. 9 


688 


2 


T48796 


probable ATP-depen 


837 


81 


2. 9 
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2 


T13280 


NADH2 dehydrogenas 
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81 


2 . 9 


698 


2 


T30948 


hypothetical prote 


839 


81 


2. 9 


700 


2 


T13726 


NADH2 dehydrogenas 


840 


81 


2 . 9 


701 


2 


T12296 


NADH2 dehydrogenas 


841 


81 


2 . 9 


704 


2 


T13581 


NADH2 dehydrogenas 


842 


81 


2 . 9 


726 


2 


T12215 


NADH2 dehydrogenas 


843 


81 


2 . 9 


797 


2 


A70453 


glutamate ammonia 


844 


81 


2 . 9 


807 


2 


S51460 


hypothetical prote 


845 


81 


2.9 


820 


2 


T46412 


ubiqui tin-protein 
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81 


2 . 9 


894 


2 


D82127 


alcohol dehydrogen 
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81 
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906 


2 


T28034 


hypothetical prote 


848 


81 
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920 


2 


F64697 


isoleucine— tRNA li 


849 


81 


2. 9 


964 


2 


S57379 


MSH2 protein - yea 


850 


81 


2.9 


1024 


2 


C64208 


hypothetical prote 


851 


81 


2.9 


1122 


2 


A97814 


transcript ion- repa 


852 


81 


2. 9 


1124 


2 


F71719 


hypothetical prote 


853 


81 


2 . 9 


1126 


1 


WMFM12 


125K protein - alf 


854 


81 


2. 9 


1253 


2 


S62544 


hypothetical prote 


855 


81 


2.9 


1544 


2 


G96904 


DNA segregation AT 


856 


81 


2. 9 


1678 


2 


S52588 


clathrin heavy cha 


857 


81 


2.9 


1986 


2,. 


..S28353 


probable polyketid 


858 


81 


2. 9 


3084 


1 


MMMSA 


laminin alpha-1 ch 


859 


80.5 


2.9 


339 


2 


B72402 


UDP-N-acetylglucos 


860 


80.5 


2.9 


345 


2 


D70564 


hypothetical prote 


861 


80.5 


2.9 


360 


2 


G64686 


conserved hypothet 


862 


80.5 


2.9 


375 


2 


S63442 


probable protein k 


863 


80.5 


2.9 


398 


2 


A37274 


glia-derived nexin 


864 


80.5 


2.9 


403 


2 


A71484 


probable phosphogl 


865 


80.5 


2.9 


423 


2 


AD2455 


hypothetical prote 


866 


80.5 


2.9 


430 


2 


S77279 


hypothetical prote 


867 


80.5 


2.9 


435 


2 


A81416 


probable integral 


868 


80.5 


2.9 


435 


2 


H71910 


probable processin 



869 


80.5 


2.9 


439 


2 


A83363 


probable cytochrom 


870 


80.5 


2.9 


446 


2 


H97929 


histidine kinase ( 


871 


80.5 


2.9 


453 


2 


AF2401 


cytochrome P450 [i 


872 


80.5 


2.9 


461 


2 


T47782 


hypothetical prote 


873 


80.5 


2.9 


488 


1 


QXASM4 


NADH2 dehydrogenas 


874 


80.5 


2.9 


491 


2 


AC1499 


transmembrane prot 


875 


80.5 


2.9 


492 


2 


A34272 


testosterone 7alph 


876 


80.5 


2.9 


499 


2 


S53637 


protein kinase elk 


877 


80.5 


2.9 


562 


2 


T29858 


hypothetical prote 


878 


80.5 


2.9 


583 


2 


G83375 


probable glycosyl 


879 


80.5 


2.9 


595 


2 


E82934 


hypothetical prote 


880 


80.5 


2.9 


642 


2 


S64348 


hypothetical prote 


881 


80.5 


2.9 


652 


2 


H86221 


hypothetical prote 


882 


80 . 5 


2 . 9 


698 


2 


T12629 


NADH2 dehydrogenas 


883 


80. 5 


2 . 9 


698 


2 


T12625 


NADH2 dehydrogenas 


884 


80.5 


2.9 


702 


2 


T12624 


NADH2 dehydrogenas 


885 


80 . 5 


2 . 9 


719 


2 


T13793 


NADH2 dehydrogenas 


886 


80.5 


2.9 


741 


2 


T13764 


NADH2 dehydrogenas 


887 


80.5 


2.9 


741 


2 


T13663 


NADH2 dehydrogenas 


888 


80.5 


2 . 9 


744 


2 


T12714 


NADH2 dehydrogenas 


889 


80.5 


2 . 9 


826 


2 


T33796 


hypothetical prote 


890 


80.5 


2. 9 


984 


2 


D70461 


preprotein translo 


891 


80 . 5 


2 . 9 


1025 


2 


159331 


thyrotropin- releas 


892 


80.5 


2 . 9 


1032 


2 


S12153 


pol polyprotein - 


893 


80.5 


2 . 9 


1042 


2 


H75112 


molybdenum cofacto 


894 


80 . 5 


2 . 9 


1301 


2 


T07321 


DNA-directed RNA p 


895 


80 . 5 


2 . 9 


1568 


2 


T41013 


hypothetical prote 


896 


80 


2.9 


217 


2 


E97310 


uncharacterized co 


897 


80 


2 . 9 


285 


2 


D69632 


transcription anti 


898 


80 


2 . 9 


318 


2 


D71501 


hypothetical prote 


899 


80 


2 . 9 


324 


2 


E91186 


probable transcrip 


900 


80 


2 . 9 


324 


2 


D86033 


probable transcrip 


901 


80 


2.9 


333 


2 


S77222 


hypothetical prote 


902 


80 


2 . 9 


366 


2 


C81336 


RodA protein homol 


903 


80 


2 . 9 


394 


2 


B70411 


fimbrial assembly 


904 


80 


2.9 


397 


2 


148717 


proteinase inhibit 


905 


80 


2. 9 


404 


2 


H95937 


hypothetical prote 


906 


80 


2 . 9 


404 


2 


T44600 


hypothetical prote 


907 


80 


2 . 9 


413 


2 


S52018 


translation initia 


908 


80 


2 . 9 


420 


1 


S34379 


glycine hydroxymet 


909 


80 


2.9 


426 


2 


A4.0440 


endothelin 1 and 2 


910 


80 


2 . 9 


466 


2 


H84971 


asparagine-tRNA li 


911 


80 


2.9 


486 


2 


E96757 


protein glycosyl t 


912 


80 


2.9 


494 


2 


JC5320 


cytochrome P450 mo 


913 


80 


2 . 9 


510 


1 


A56926 


activin receptor I 


914 


80 


2 . 9 


545 


2 


T40207,„ .... 


hypothetical prote 


915 


80 


2.9 


567 


2 


E72156 


C6R protein - vari 


916 


80 


2.9 


567 


2 


136841 


E6R protein - vari 


917 


80 


2.9 


567 


2 


T28485 


hypothetical prote 


918 


80 


2.9 


610 


1 


DCZMP 


pyruvate decarboxy 


919 


80 


2 . 9 


640 


2 


F71527 


probable transketo 


920 


80 


2.9 


663 


2 


T41963 


hypothetical prote 


921 


80 


2.9 


685 


2 


AF0850 


hypothetical prote 


922 


80 


2.9 


686 


2 


T12607 


NADH2 dehydrogenas 


923 


80 


2.9 


686 


2 


T12128 


NADH2 dehydrogenas 


924 


80 


2.9 


697 


2 


T13670 


NADH2 dehydrogenas 


925 


80 


2.9 


698 


2 


T12713 


NADH2 dehydrogenas 



926 


80 


2.9 


698 


2 


T12567 


NADH2 dehydrogenas 


927 


80 


2.9 


- 700 


2 


T13661 


NADH2 dehydrogenas 


928 


80 


2.9 


741 


2 


T13705 


NADH2 dehydrogenas 


929 


80 


2.9 


744 


2 


T13399 


NADH2 dehydrogenas 


930 


80 


2.9 


760 


2 


T34414 


hypothetical prote 


931 


80 


2.9 


775 


2 


E70320 


polyribonucleotide 


932 


80 


2.9 


789 


2 


T09672 


ent-kaurene syntha 


933 


80 


2.9 


826 


1 


T02753 


S-receptor kinase 


934 


80 


2.9 


906 


2 


B96901 


uncharacterized co 


935 


80 


2.9 


920 


2 


T10052 


aminopeptidase (EC 


936 


80 


2.9 


950 


2 


T09076 


hypothetical prote 


937 


80 


2.9 


1183 


2 


F90559 


conserved hypothet 


938 


80 


2.9 


1356 


2 


S51389 


ROM2 protein - yea 


939 


80 


2.9 


1620 


2 


E83261 


conserved hypothet 


940 


80 


2.9 


2108 


2 


S28417 


CDC39 protein - ye 


941 


80 


2.9 


2178 


2 


S55805 


alpha-toxin - Clos 


942 


80 


2.9 


3744 


2 


S46715 


hypothetical prote 


943 


80 


2.9 


5232 


2 


A45086 


HC-toxin synthetas 


944 


80 


2.9 


7962 


2 


138346 


elastic titin - hu 


945 


79.5 


2.9 


271 


2 


G84030 


hemA concentration 


946 


79.5 


2.9 


292 
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2.7 


743 


2 


T13700 


NADH2 dehydrogenas 



1439 


75.5 


2.7 


744 


2 


T13376 


NADH2 dehydrogenas 


1440 


75.5 


2.7 


751 


2 


S38101 


hypothetical prote 


1441 


75.5 


2.7 


756 


2 


T12697 


NADH2 dehydrogenas 


1442 


75. 5 


2.7 


782 


2 


D81281 


probable nucleotid 


1443 


75. 5 


2.7 


787 


2 


S72725 


guanosine-3 1 , 5 1 bis 


1444 


75. 5 


2.7 


838 


2 


T09054 


capsaicin receptor 


1445 


75.5 


2.7 


848 


2 


B84107 


hypothetical prote 


1446 


75.5 


2.7 


875 


2 


D81651 


conserved hypothet 


1447 


75. 5 


2.7 


915 


1 


A55144 


autotaxin precurso 


1448 


75. 5 


2.7 


928 


2 


T52292 


endopeptidase Clp 


1449 


75.5 


2.7 


949 


2 


H97322 


DNA/RNA helicase, 


1450 


75.5 


2.7 


971 


2 


A70179 


exodeoxyribonuclea 


1451 


75. 5 


2.7 


975 


2 


AC2517 


hypothetical prote 


1452 


75. 5 


2.7 


1022 


2 


153078 


homeotic gene regu 


1453 


75. 5 


2.7 


1031 


2 


T06130 


hypothetical prote 


1454 


75.5 


2.7 


1155 


2 


H71456 


probable pyrolysin 


1455 


75. 5 


2.7 


1178 


2 


S44142 


VLA-2 protein homo 


1456 


75. 5 


2.7 


1225 


2 


T39255 


probable C2 domain 


1457 


75.5 


2.7 


1285 


2 


B72420 


hypothetical prote 


1458 


75.5 


2.7 


1318 


1 


HIBPD7 


internal virion pr 


1459 


75.5 


2.7 


1430 


2 


S50596 


hypothetical prote 


1460 


75. 5 


2.7 


1450 


2 


A84780 


probable ABC trans 


1461 


75. 5 


2 . 7 


1605 


2 


T31435 


DNA-directed RNA p 


1462 


75. 5 


2.7 


1647 


2 


S45252 


SNF2beta protein - 


1463 


75. 5 


2.7 


1657 


2 


T25421 


hypothetical prote 


1464 


75 . 5 


2 . 7 


1764 


2 


S37827 


hypothetical prote 


1465 


75 . 5 


2.7 


1874 


1 


JQ0533 


genome polyprotein 


1466 


75.5 


2.7 


2022 


2 


A59256 


myosin-IXb [simila 


1467 


75 . 5 


2 . 7 


2136 


2 


A05037 


hypothetical prote 


1468 


75. 5 


2.7 


2471 


2 


T03820 


probable histidine 


1469 


75.5 


2.7 


2748 


2 


S57976 


nuclear migration 


1470 


75. 5 


2.7 


8243 


2 


T31307 


type I fatty acid 


1471 


75 


2.7 


249 


2 


E69298 


conserved hypothet 


1472 


75 


2.7 


263 


2 


F45734 


orf6 3* to hisD - 


1473 


75 


2 . 7 


288 


2 


G86223 


hypothetical prote 


1474 


75 


2.7 


295 


1 


H70031 


conserved hypothet 


1475 


75 


2.7 


295 


2 


G85042 


hypothetical prote 


1476 


75 


2.7 


312 


2 


D90198 


transposase ISC123 


1477 


75 


2.7 


314 


2 


S75872 


hypothetical prote 


1478 


75 


2.7 


325 


2 


F91024 


NADH dehydrogenase 


1479 


75 


2.7 


325 


2 


G85868 


NADH dehydrogenase 


1480 


75 


2.7 


325 


2 


H64999 


NADH2 dehydrogenas 


1481 


75 


2.7 


327 


2 


D82090 


conserved hypothet 


1482 


75 


2 . 7 


333 


2 


AG2121 


lucif erase-alpha c 


1483 


75 


2.7 


350 


2 


AH3043 


dehydrogenase Atu3 


1484 


75 


2.7 


350 


2_ 


. D98242 


lipopolysaccharide 


1485 


75 


2.7 


357 


2 


C72022 


UDP-N-acetylglucos 


1486 


75 


2.7 


357 


2 


F86603 


peptidoglycan tran 


1487 


75 


2.7 


372 


2 


138042 


single-stranded DN 


1488 


75 


2.7 


376 


2 


AE1878 


phospho-2-dehydro- 


1489 


75 


2.7 


391 


2 


A89870 


hypothetical prote 


1490 


75 


2.7 


392 


1 


SYPJCB 


naringenin-chalcon 


1491 


75 


2.7 


400 


1 


F69142 


probable hexosyltr 


1492 


75 


2.7 


404 


2 


T19445 


hypothetical prote 


1493 


75 


2.7 


404 


2 


C64332 


hypothetical prote 


1494 


75 


2.7 


408 


2 


D95392 


protein [imported 


1495 


75 


2.7 


411 


2 


G97802 


tyrosine-tRNA liga 



1496 


75 


2.7 


414 


2 


D70723 


probable trans f era 


1497 


75 


2.7 


,416 


2 


F81337 


RNA polymerase sig 


1498 


75 


2.7 


419 


2 


E64526 


hypothetical prote 


1499 


75 


2.7 


432 


2 


G97224 


ATP-dependent prot 


1500 


75 


2.7 


448 


2 


T43624 


beta tubulin, temp 



ALIGNMENTS 



RESULT 1 
JC5423 

2-hydroxyacylsphingosine 1-beta-galactosyltransf erase (EC 2.4.1.45) - human 
N; Alternate names: UDP-galactose : ceramide galactosyltransf erase 
C; Species: Homo sapiens (man) 

C;Date: 10-Jun-1997 #sequence_revision 18-Jul-1997 #text_change 26-Aug-1999 
C;Accession: JC5423 
R;Kapitonov, D. ; Yu, R.K. 

Biochem. Biophys . Res. Commun. 232, 449-453, 1997 

A; Title: Cloning, characterization, and expression of human ceramide 
galactosyltransf erase cDNA. 

A;Reference number: JC5423; MUID : 97242209 ; PMID:9125199 

A; Accession: JC5423 

A; Molecule type: mRNA 

A; Residues: 1-541 <KAP> 

A/Cross-references : GB:U62899 

A; Experimental source: fetal glioma cell 

C; Comment: This enzyme catalyzes the final step of galactosylceramide synthesis. 
C; Genetics : 
A; Gene: cgt 

C; Superf amily : glucuronosyltransf erase 

C; Keywords : glycoprotein; glycosyltransf erase; hexosyltransf erase 
F;472-492/Domain: hydrophobic #status predicted <HYD> ' 

F; 538-540/Region : endoplasmic reticulum retention signal #status atypical 
F; 78, 333, 442/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 25.3%; Score 700; DB 2; Length 541; 

Best Local Similarity 32.2%; Pred. No. 8.7e-47; 

Matches 166; Conservative 95; Mismatches 185; Indels 70; Gaps 12; 

Qy 13 LLPGVTjLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I | : : : | I | | : : : II : : : I : I I : I : I : I 
Db 11 LWSAVGIAKAAKIIIVPPIMFESHMYIFKTLASALHERGHHTVFLLSEGRD 61 

Qy 71 EKS YQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : J Ill:,: II I : : I : : 

Db 62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAIELFDILDHYTK 112 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : I I I I I : : : I : I I I : I I II : : : III 
Db 113 NCDMMVGNHALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQHM QSTFD 222 

I I : II I I II I I I I : I : I I I I : : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNLLQRMKNTGVYLISRLGVS FLVLPKYERIMQKYNLLPEKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 



I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
Db 233 LVHGSSLWMLCTDVALEFPRPTLPNVVYVGGI LTKPASPLPED 275 

Qy 283 LENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 

I : : : : I I I I I : I : I : : I : : I III III: I I : I 

Db 276 LQRWVNGANEHGFVLVS FGAGVKYL-SEDIANKLAGALGRLPQKVIWRFS GPKPKNL 331 

Qy 343 AANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

I I : :: I I I I : I I I I I : 11 = 111 III I : I I I I : I I I I : I I I : I I I : 
Db 332 GNNTKLI EWLPQNDLLGHSKI KAFVSHGGLNS I FETMYHGVPWGI PVFGDH YDTMTRVQ 391 

Qy 403 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 462 

II I : : : I : : I : : : : : | : | | | : | | | : | | | : : 

Db 392 AKGMGILLEWKTVTEKELYEALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTIYWIDYI 451 

Qy 463 LQTGGATHLKPYVFQQPWHEQYLFDV- FVFLLGLTL 497 

:: MM: | | : : : I I : I I I I I I 
Db 452 I RHNGAHHLRAAVHQI S FCQYFLLDIAFVLLLGAAL 487 



RESULT 2 
A48801 

2-hydroxyacylsphingosine 1-beta-galactosyltransf erase (EC 2.4.1.45) precursor - 
rat 

N;Alternate names: UDPgalactose-ceramide galactosyltransf erase 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 07-Apr-1994 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 
C;Accession: A48801; 156576; S63480 
R;Schulte, S.; Stoffel, W. 

Proc. Natl. Acad. Sci. U.S.A. 90, 10265-10269, 1993 

A;Title: Ceramide UDPgalactosyltransf erase from myelinating rat brain: 
purification, cloning, and expression. 

A; Reference number: A48801; MUID: 94052143; PMID: 7694285 
A;Accession: A48801 
A; Status: preliminary 
A;Molecule type: mRNA; protein 
A; Residues: 1-541 <SCH> 

A; Cross-references: UNIPROT : Q09426; GB:L21698; NID:g437665; PIDN: AAA1 6108 . 1; 
PID:g437666 

A; Experimental source: brain 

A;Note: sequence extracted from NCBI backbone (NCBIN : 139520, NCBIP : 139522 ) 
R;Stahl, N . ; Jurevics, H. ; Morell, P.; Suzuki, K. ; Popko, B. 
J. Neurosci. Res. 38, 234-242, 1994 

A;Title: Isolation, characterization, and expression of cDNA clones that encode 
rat UDP-galactose: ceramide galactosyltransf erase . 
A;.Reference number: 156576; MUID: 94358923; PMID: 7521399 
A;Accession: 156576 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-541 <RES> 

A;Cross-references: EMBL:U07683; NID:g464025; PIDN :AAA5 0212 . 1 ; PID:g464026 

R;Schulte, S.; Stoffel, W. 

Eur. J. Biochem. 233, 947-953, 1995 

A; Title: UDP galactose : ceramide galactosyltransf erase and glutamate/aspartate 
transporter: copurif ication, separation and characterization of the two 
glycoproteins . 

A; Reference number: S63480; MUID : 96085162 ; PMID: 8521863 



A; Accession: S634 80 

A; Molecule type: protein 

A; Residues: 21-28, 'A 1 , 30-31, 'Q' , 33-39; 73-77, » X 79-87 ; 155-166, f Q ! ,168-173 ; 315- 
322; 330-331, 1 EX 1 ,334-338, 1 Q 1 , 340-353 ; 416-423; 510-515 <SUL> 
A; Experimental source: brain 
C; Function: 

A; Description: transfers galactose from UDP-galactose to ceramide 
C; Superfamily: glucuronosyltransf erase 

C; Keywords : glycoprotein; glycosyltransf erase; hexosyltransf erase 
F; 1-20/ Domain : signal sequence #status predicted <SIG> 

F;21-541/Product : 2-hydroxyacylsphingosine 1-beta-galactosyltransf erase ftstatus 
experimental <MAT> 

F;78, 333/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 24.7%; Score 684; DB 2; Length 541; 

Best Local Similarity 32.2%; Pred. No. 1.6e-45; 

Matches 165; Conservative^ 94; Mismatches 184; . Indels 70; Gaps 12; 

Qy 13 LLPGVLLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I | : : I I I I : : : II : : : I : I I : I : I : I 
Db 11 LWSAVGIARAAKI I IVPPIMFESHLYI FKTLASALHERGHHTVFLLSEGRD 61 

Qy 71 EKS YQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I : I : : I III:: II | : : : I : : 

Db 62 IDPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAVELVDILDHYTK 112 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : : I I I I I : : : I : I I I : I I II : : : III 
Db 113 NCDMMVGNQALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQH MQSTFD 222 

I I : I I I I I I I I I I : I I : I I I I : : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNFLERMKNTGVYLISRMGVSFLVLPKYERIMQKYNLLPAKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
Db 233 LVHGS SLWMLCTDVALEFPRPTLPNVVYVGGI LTKPAS PLPED 275 

Qy 283 LENFIAKFGDSGFVTiVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 

I : : : : I I I I I : I : I : : I : : I III III: : I : I 
Db 276 LQRWVXIGAQEHGFVLVSFGAGVKYL-SEDIANKLAGALGRLPQKVIWRFSGT KPKNL 331 

Qy 343 AANWIVDWLPQSDLIiAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMW 402 

I | : : : | | | | : I I I I : I I I : : I I I I II I : I I I I : I I I I I I I I : I I I : 
Db 332 GNNTKLIEWLPQNDLLGHSNIRAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQ 391 

Qy 4 03 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 462 

|| | : : : : I : : : : : I : I I I : I I I III:: 

Db 392 AKGMGILLEWNTVTEGELYDALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTTYWIDYI 451 

Qy 463 LQTGGATHLKP YVFQQPWHEQYLFDV- FVFLLG 494 

I : II ||: II : : : I I : ' I I I I I 
Db 452 LRHDGAHHLRSAVHQISFCQYFLLDIAFVLLLG 484 



RESULT 3 
A39092 



glucuronosyltransf erase (EC 2.4.1.17) 1 precursor, bilirubin-specif ic - human 
N;Alternate names: bilirubin UDP-glucuronosyltransf erase 
C; Species: Homo sapiens (man) 

C;Date: 17-Jul-1992 #sequence_revision 17-Jul-1992 #text_change 09-Jul-2004 
C;Accession: A39092; E42586; A48887 
R;Ritter, J.K.; Crawford, J.M.; Owens, I.S. 
J. Biol. Chem. 266, 1043-1047, 1991 

A; Title: Cloning of two human liver bilirubin UDP-glucuronosyltransf erase cDNAs 
with expression in COS-1 cells. 

A; Reference number: A39092; MUID: 91093210 ; PMID: 1898728 
A;Accession: A39092 
A;Molecule type: mRNA 
A; Residues: 1-533 <RIT> 

A; Cross-references: UNIPROT : P22309; GB:M57899; NID:gl84472; PIDN : AAA63195 . 1 ; 
PID:gl84473 

R;Ritter, J.K.; Chen, F. ; Sheen, Y.Y.; Tran, H.M.; Kimura, S.; Yeatman, M.T.; 
Owens , I.S. 

J. Biol. Chem. 267, 3257-3261, 1992 

A;Title: A novel complex locus UGT1 encodes human bilirubin, phenol, and other 
UDP-glucuronosyltransf erase isozymes with identical carboxyl termini. 
A; Reference number: A42586; MUID: 92147680 ; PMID: 1339448 
A; Accession: E42586 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: nucleic acid 
A; Residues: 1-288 <RI2> 

A; Cross-references: GB:M84125; NID:g340131; PIDN : AAA61248 . 1 ; PID:g340132 
A;Note: sequence extracted f rom NCBI backbone (NCBIP : 81433) 
R;Ritter, J.K.; Yeatman, M.T.; Kaiser, C. ; Gridelli, B.; Owens, I.S. 
J. Biol. Chem. 268, 23573-23579, 1993 

A; Title: A phenylalanine codon deletion at the UGT1 gene complex locus of a 
Crigler-Naj jar type I patient generates a pH-sensitive bilirubin UDP- 
glucuronosyltransf erase . 

A;Reference number: A48887; MUID : 94043159; PMID:8226884 
A; Accession: A48887 

A; Status: preliminary; not compared with conceptual translation 

A; Molecule type: DNA 

A; Residues: 161-170,172-180 <RI3> 

A; Experimental source: liver, Crigler-Naj jar type I patient 
A;Note: sequence extracted from NCBI backbone (NCBIP : 138934 ) 
C; Genetics : 

A; Gene: GDB:UGT 1A1; UGT1 

A;Cross-references: GDB: 120007; OMIM: 191740 

A; Map position: 2q37-2q37 

C; Super family : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 24.5%; Score 679; DB 2; Length 533; 

Best Local Similarity 33.8%; Pred. No. 3.8e-45; 

Matches 175; Conservative 93; Mismatches 211; Indels 38; Gaps 13; 

Qy 8 LLVGFLL — PGVLLSEAAKILTI STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKRGPFMP 65 

I : : I I I I :: I I I I I I I I I I : I I III I I : : I I 
Db 11 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 62 

Qy 66 D FKKEEKS YQVT SWLAPEDHQRE- FKKS F DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : : : 

Db 63 DASLYIRDGAFYTLKTY — PVPFQREDVKESFVSLGHNVFENDSFLQRVI KTYKKIK 117 



Qy 112 NVLEYLALQCSHFLNRKD.IMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I I I I : t : : I I I : | | : : : : I I : : I : I II I 
Db 118 KDSAMLLSGCSHLLHNKEIJyiASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 

Qy 172 SLEF GLP I PLS YVPVFRS LLTDHMDFWGRVKN FLMFFS FCRRQQHMQST FDNT I KEH 228 

I I I I I I I I I I I : I I I I I I I I I : I I : | : | : 

Db 178 SLEFEATQCPNPFSWPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDVVYSPY-ATLASE 236 

Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 288 

I : : II I : I III I : I I : : I I I : I I I : I : I : I : I 

Db 237 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 295 

Qy 289 KFGDSGFVIjWLGSMWTCQNPEIFKEMNNAFAHLPQGVIW 348 

I : I I : : I I I I I : : : : I : I I I : I : : : : I I I : 
Db • 296 ASGEHGI WFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSNLANNTIL 351 

Qy 349 VDWLPQSDLLAHPS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGV 408 

I I I I I : II I I I I I : I I I : : I : I : I I I I I : I I I I I I : I I : I I II 
Db 352 VKWLPQNDLLGHPMTRAFITHAGSHGWESICNGVPM^ 411 

Qy 409 SIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGA 468 

:::::: I I : | : : I I I I : I : : I : I I I : : I : : II 
Db 412 TLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGA 471 

Qy 469 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I I : : : I I I I I : I : : I 

Db 472 PHLRPAAHDLTWYQ YHS LDVI GFLLAVVLTVAFI T FK 508 



RESULT 4 
A404 67 

glucuronosyltransf erase (EC 2.4.1.17) precursor - rat 
N;Alternate names: UDP-glucuronosyltransf erase isoform 53K 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 07-Feb-1992 #sequence_revision 07-Feb-1992 #text_change 09-Jul-2004 
C;Accession: A40467; A23520; S59627; A26064; 155247 
R;Haque, S.J.; Petersen, D.D.; Nebert, D.W.; Mackenzie, P.I. 
DNA Cell Biol. 10, 515-524, 1991 

A; Title: Isolation, sequence, and developmental expression of rat UGT2B2: the 
gene encoding a constitutive UDP glucuronosyltransf erase that metabolizes 
etiocholanolone and androsterone . 

A; Reference number: A40467; MUID: 91369480; PMID: 1909872 
A;Accession: A40467 
A; Status: preliminary 

A; Molecule type: DNA ..... - . 

A; Residues: 1-530 <HAQ> 

A; Cross-references : UNIPROT : P08541 

R; Jackson, M.R.; Burchell, B. 

Nucleic Acids Res. 14, 779-795, 1986 

A; Title: The full length coding sequence of rat liver androsterone UDP- 
glucuronyltransf erase cDNA and comparison with other members of this gene 
family. 

A; Reference number: A23520; MUID: 86120371 ; PMID:3003696 
A; Accession: A23520 
A;Molecule type: mRNA 



» 



A;Residues: 31-158, 'E', 160-285, *S ! , 287-350, 'I 1 , 352-362, , I»,364-430, "E" ,432-530 
<JAC> 

A;Cross-references: GB:X03478; NID:g57452; PIDN : CAA27198 . 1 ; PID:g57453 
A; Note: the authors translated the codon ATT for residue 321 as Asn 
R;Yamashita, A.; Watanabe, M. ; Tonegawa, T. ; Sugiura, T . ; Waku, K. 
Biochem. J. 312, 301-308, 1995 

A;Title: Acyl-CoA binding and acylation of UDP-glucuronosyltransf erase isoforms 

of rat liver: their effect, on enzyme activity. 

A;Reference number: S59626; MUID: 96077159; PMID:7492328 

A; Accession: S59627 

A;Molecule type: protein 

A; Residues: 24-44 <YAM> 

R;Mackenzie, P.I. 

J. Biol. Chem. 261, 14112-14117, 1986 

A; Title: Rat liver UDP-glucuronosyltransf erase. cDNA sequence and expression of 

a form glucuronidating 3-hydroxyandrogens . 

A;Reference number: A26064; MUID: 87033594 ; PMID:2429951 

A;Accession: A26064 

A;Molecule type: mRNA 

A;Residues: 1-430, ' E ' , 432-530 <MAC> 

A;Cross-references: GB:J02589; NID:g207582; PIDN : AAA42314 . 1 ; PID:g207583 

A; Experimental source: hepatic 

C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltrans f erase; transmembrane protein 
F; 1-23/Domain: signal sequence ftstatus predicted <SIG> 
F;24-530/Product : glucuronosyltransf erase #status experimental <MAT> 

Query Match .24.4%; Score 674.5; DB 2; Length 530; 

Best Local Similarity 33.5%; Pred. No. 8.4e-45; 

Matches 170; Conservative 86; Mismatches 219; Indels 33; Gaps 13; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISW-LAPEDHQREFKKS 92 

II:::: I I I I I : I I : I : : : : I : : : : I I I 

Db 34 SHWMNIKIILDELVQRGHEVTVLKPSAYFFLDPKKSSDLKFEIFSTSISKDELQNHFIKL 93 

Qy 93 FDFFLEE TLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDSLKNENFDMVIVET 146 

I : I I : I I : III : : I : I I : I I : : : 

Db 94 LDVWTYELPRDTCLSYSPILQNLVYEFSYFYLSICKDAVSNKQLMTKLQESKFDVLFADP 153 

Qy 147 FDYCPFLIAEKLGKPFVAILSTSFG-SLEFGLP IPLSYVPVFRSLLTDHMDFWGRVK 202 

I I I I I I II: I I I I II : : I I II II II II III 

Db 154 VASCGDLIAELLHIPFLYSLSFSPGHKLEKSIGKFILPPSYVPVILSGLAGKMTFIDRVK 213 

Qy 203 NF— LMFFSF-CRRQQHMQ-STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

I : : : I I I : I : I I : I I : : I I : I I I : I 

Db 214 NMICMLYFDFWFERLRHKEWDTEYSEIL GRPTTVDETMSKVEIWLIRSYWDLKFP 268 

Qy 259 RPLLPNTVWGGmEKPIKPVPQDLENFIAKFGDSGFVXWLGSMWTCQNPEIFKE^INN 318 

I I I I I : I I I I I I I : I : I : I I : I : I I : : I I I I I : : : I I 

Db 269 HPTLPNVDYIGGLHCKPAKPLPKDMEEFVQSSGEHGVWFSLGSMVS NMTEEKAN 323 

Qy 319 AFAHLPQGVIWKCQCSHWPKDVIiLAANvTaVDWLPQS 374 

I I : I I I : I I I I : : I I I I : I I I I I : I I I I I I I 

Db 324 AIAWALAQI PQKVLWKFD GKTPATLGPNTRVYKWLPQNDLLGHPKTKAFVTHGGANG 380 

Qy 375 IMEMQHGVPMVGIPLFGDQPENMVTWEAKKFGVSIQ^ 434 

: I I I II : I I : I I I I I I I I I : I : : I I I I : :: : :::::: II 



Db 



381 L YEAI YHGI PMI GI P L FGDQ P DN I AHMVAKGAAVS LN I RTMS KLD FL S ALVEVT DN P F YK 440 



Qy 435 SAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLG 494 

: I I I : I I I I I : : : : I I I I : I I I : : : I I I I I 

Db 441 KNVMLLSTIHHDQPMKPLDRAVFWIEFIMRHKGAKHLRPLGHNLPWYQYHSLDVIGFLLT 500 

Qy 495 LTLGTLWLCGK-LLGMAWWLRGARKVK 521 

I I I I I : : : : : I : I 
Db 501 CFAVIAALTVKCLLFMYRFFVKKEKKMK 528 



RESULT 5 
A42233 

glucuronosyltransf erase (EC 2.4.1.17) 2 - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 19-Jun-1992 #sequence_revision 18-Sep-1992 #text_change 09-Jul-2004 

C;Accession: A42233; A24324 

R;Mackenzie, P.I.; Rodbourn, L. 

J. Biol. Chem. 265, 11328-11332, 1990 

A; Title: Organization of the rat UDP-glucuronosyltransf erase, UDPGTr-2, gene and 
characterization of its promoter. 

A; Reference number: A42233; MUID: 90293083 ; PMID:2113533 
A;Accession: A42233 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-529 <MAC> 

A; Cross-references: UNIPROT : P09875; GB:J05482 

A; Note: the authors translated the codon GTA for residue 57 as B, and GTC for 
residue 116 as B 
R;Mackenzie, P.I. 

J. Biol. Chem. 261, 6119-6125, 1986 

A; Title: Rat liver UDP-glucuronosyltransf erase . Sequence and expression of a 

cDNA encoding a phenobarbital-inducible form. 

A; Reference number: A24324; MUID : 86196018 ; PMID: 3084479 

A;Accession: A24324 

A;Molecule type: mRNA 

A; Residues: 1-407 , 'V , 409-529 <MA2>' 

A; Cross-references: GB:M13506; NID:g207580; PIDN : AAA42313 . 1 ; PID:g207581 

A; Experimental source: liver 

C; Superf amily : glucuronosyltransf erase 

C;Keywords: glycosyltrans f erase; hexosyltransf erase; transmembrane protein 

Query Match 24.2%; Score 671; DB 2; Length 529; 

Best Local Similarity 32.2%; Pred. No. 1.6e-44; 

Matches 175; Conservative 90; Mismatches 228; Indels 50; Gaps 15; 

Qy 6 VXLLVGFLLPGVXLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

:: I : : I I "I : I I I I : : : : I I I I I : I : 

Db 11 LIQLICYFRPGA CGKVLVWPT-EYSHWINIKIILNELAQRGHEVTVLVSSASILIE 65 

Qy 66 DFKKEEKSYQVISW-LAPEDHQREFKKSFDFFLE— ETLG GRGKFENLLN VLE 115 

I : : : : : | | : I : I I I : Ml I : : I I : I 

Db 66 PTKESSINFEIYSVPLSKSDLEYSFAKWIDEWTRDFETLSIWTYYSKMQKVFNEYSDWE 125 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I : | : | I : I I : :: : I I : I I I I I I I 
Db 126 NL CKALIWNKSLMKKLQGSQFDVI LADAVGPCGELLAELLKTPLVYSLRFCPGYRCE 182 



Qy . 172 SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL — MFFSFCRRQQHMQSTFDNTIKEHF 229 

111:111111 Ihlll I I I I I : : I I I : : : : 

Db 183 KFSGGLPLPPSWPVVLSELSDRMTF\^RVKNMLQMLYFDF WFQPFKEKSWSQFY 237 

Qy 230 TE — GSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI 287 

: : I I : : : I I : : I ■ I : : : I I I I I : I I I I I I I I : I :: : I I : 
Db 238 SDVLGRPTTLTEMMGKADIWLIRTFWDLEFPHPFLPNFDBVGGLHCKPAKPLPREMEEFV 297 

Qy 288 AKFGDSGFVT,VTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAAW 347 

I : I I : : I I I I I I : : I I : I I I : I : I I : I : 

Db 298 QSSGEHGVWFSLGSMVKNL-TEEKANWASALAQIPQKVVWRFD GKKPDTLGSNTR 353 

Qy 348 I VDWLPQSDLLAHPS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFG 407 

: I : I I : I I I I I : I I I I I I I I I I I I : I : I I I I I I I I I : ! : : : I I 
Db 354 LYKWI PQNDLLGHPKTKAFVAHGGTNGI YEAI YHGI PIVGI PLFADQPDNINHMBAKGAA 413 

Qy 408 VSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGG 467 

I : I I : I : I I II I : II hi I I I I : : I : : I 

Db 414 VRVDFSILSTTGLLTALKIV1WDPSYKENAMRLSRIHHDQPVKPLDRAVFWIEYVMRHKG 473 

Qy 468 ATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGT LWLCGKLLGMAVWWLRGARK 519 

III:: I :: I I I I I : I I : I I I : I 

Db 474 AKHLRSTLHDLSWFQYHSLDVIGFLLLCWGWFIITKFCLFCCRKTANM GKK 526 

Qy . 520 VKE 522 

II 

Db 527 KKE 529 



RESULT 6 
B47113 

glucuronosyltransf erase (EC 2.4.1.17) UGT2B13 precursor - rabbit 
C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 09-Jul-2004 
C;Accession: B47113 

R;Tukey, R.H.; Pendurthi, U.R.; Nguyen, N.T.; Green, M.D.; Tephly, T.R. 
J. Biol.' Chem. 268, 15260-15266, 1993 

A; Title: Cloning and characterization of rabbit liver UDP- 

glucuronosyltransferase cDNAs . Developmental and inducible expression of 4- 
hydroxybiphenyl UGT2B13. 

A; Reference number: A47113; MUID: 93315511; PMID: 8325897 
A;Accession: B47113 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-531 <TUK> 

A; Cross-references: UNIPROT : P36512 ; GB:L01081; NID:gl65796; PIDN : AAA18020 . 1 ; 
PID:gl65797 

C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 24.2%; Score 670; DB 2; Length 531; 

Best Local Similarity 32.6%; Pred. No. 1.9e-44; 

Matches 169; Conservative 90; Mismatches 206; Indels 54; Gaps 14; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTML NHKRG PFMPDFKKEEKSYQVISW 79 

I I : : I : I I I I I : I I : : I I : : I : I : I 



f 



Db 


35 


SHWMNMKTI LDALVQQGHEVTVLRS SAS I VI GSNNESGI KFETFHTS YRKDEI ENFFMDW 


94 


Qy 


80 


LAPEDHQREFKKSFDFFLE ETLGGRGKFENLLNVLEYLAL QCSHFLNRKDIMDS 


133 


Db 


95 


: 1 :: : 1 II : : | : | : | : I : I 
F YKMIYNVSIESYWETFS LTKMVILKYSDICEDICKEVILNKKLMTK 


141 


Qy 


134 


LKNENFDMVIVETFDYCPFLIAEKLGKP FVAILSTSFGSLEFGLPI PLSYVPV 

1 : 1 1 : 1 : : hill 1 II : 1 1 1 : 1 1 1 1 1 1 
LQESRFDWLADPVSPGGELLAELLKIPLVYSLRGFVGYMLQKHGG GLLLPPSYVPV 


186 


Db 


142 


198 


wy 


187 


FRSLLTDHMDFWGRVKNFL — MFFSFCRROOHMOSTFDNTIKEHFTEGSRPV-LSHLLLK 

II 1 1 1 1 : 1 1 : : 1 1 : : : 1 1 III 1 : 1 
MMSGLGSQMT FMERVQNLLCVLYFDFW-FPKFNEKRWDQFYSEVL GRPVT FLELMGK 


243 


Db 


199 


254 


'Qy 


244 


AELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSM 
1 : : 1 1 1 : : 1 1 1 1 1 1 1 : : 1 1 1 1 1 1 1 : 1 1 : : 1 : 1 : 1 : 1 1 : : 1 I 1 1 


303 


Db 


255 


ADMWLIRSYWDLEFPRPLLPNFDFIGGLHCKPAKPLPQEMEDFVQSSGEEGVWFSLGSM 


314 


Qy 


304 


WTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTI 

:: 1 : : 1 1 1 1 1 1 : 1 : : 1 1 : 1 : : 1 : 1 1 : 1 1 1 1 1 
ISNL-TEERANVIASALAQLPQKVLWRFE GKKPDMLGSNTRLYKWI PQNDLLGHPKT 


363 


Db 


315 


370 


Qy 


364 


RLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRV^^ 

: 1 : 1 1 1 1 1 : I j I I 1 : I 1 1 1 : 1 1 1 1 1 1 : 1 : 1 : : 1 1 | : | | : : | 
KAFITHGGANGVFEAIYHGIPMVGLPLFGDQLDNIVYMKAKGAAVKLNLKTMSSADLLNA 


423 


Db 


371 


430 


Qy 


424 


MKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQ 

• 1 • • 1 It 1 : II 1:1 1 1 1 1 : : 1 : : Mil: 1 : : 

• i • » i ii i • ii i • i i i i i • • i • ■ ii ii« i • • 

LKTVINDPSYKENAMTLSRIHHDQPMKPLDRAVFWIEYVMRHKGAKHLRVAAHDLTWYQY 


483 


Db 


431 


490 


Qy 


484 


YLFDVFVFLLGLTLGTLWLCGKLLGW^VWWLRGARKVKE 522 

: II III 1 : 1 1 : :: 1 1 1 1 : 

HS LDVI GFLLACVAI TT YLI VKCCLLVYRYVLGAGKKKK 529 




Db 


491 





RESULT 7 
C47113 

glucuronosyltrans f erase (EC 2.4.1.17) UGT2B14 precursor - rabbit 
C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 09-Jul-2004 
C;Accession: C47113 

R;Tukey, R.H.; Pendurthi f U.R.; Nguyen, N.T.; Green, M.D.; Tephly, T.R. 
J. Biol. Chem. 268, 15260-15266, 1993 

A;Title: Cloning and characterization of rabbit liver UDP- 

glucuronosyltransferase cDNAs . Developmental and inducible expression of 4- 

hydroxybiphenyl UGT2B13. 

A; Reference number: A47113; MUID: 93315511; PMID: 8325897 
A/Accession: C47113 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-530 <TUK> 

A;Cross-references: UNIPROT : P36513; GB:L01082; NID:gl65798; PIDN : AAA18021 . 1 ; 
PID:gl65799 

C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 



Query Match 



23.9%; Score 662.5; DB 2; Length 530; 



Best Local Similarity 34.5%; Pred. No. 7.3e-44; 

Matches 161; Conservative 75; Mismatches 209; Indels 21; Gaps . 10; 



Qy 50 GHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQRE-FKKSFDFFLEETLGGRGKFE 108 

I I I : I : I : I : : : I : I : I : : I : 

Db 51 GHEVIVLRNSASIFIDPSKQANIKFETFPIAATKDDLEDLFVHYVSTWTNARQNSQWKYF 110 

Qy 109 NLLNVL— EY LALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFV 163 

: I I I II I : I : I I : I I : :: : I hill III 

Db 111 SLLQKLFSEYSDSCENACKEWFNKTLMTKLQESRFDILLSDAIGPCGELLAELLKIPFV 170 

Qy 164 AILSTSFG SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL — MFFSFCRRQQHM 217 

I : I I I : I ! I I ! : I I : I I II I I : : I I I 

Db 171 YSLRFTPGYTMEKYSGGLSVPPSYVPIILSDLSGKMTFMERVNNMLCMLYFDFW-FQMFN 229 

Qy 218 QSTFDNTIKEHFTEGSRPV-LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 276 

: : I I III I h I I : : I I I : : I I I I I I : I I I I II 

Db 230 KKRWDQFYSEVL GRPVTFSELVGKADMWLIRSYWDLEFPRPTLPNIQFVGGLHCKPA 286 

Qy 277 KPVPQDLENFIAKFGDSGFVXVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHW 336 

I |: I : : : I h h I I : : I I I I h I : : I I I I I I I I h 
Db 287 KPLPKEMEEFVQSSGEEGVWFSLGSMVSN-MTEERANLIASAFAQLPQKVIWRFD G 342 

Qy 337 PKDVlilAANVKI VTWLPQSDLLAHPSIRL 396 

I I I : I I I : I I : I I I I I : I I I I I I I I I I I I hi I I I : I I I h I h 
Db 343 QKPETLGPNTRI YDWI PQNDLLGHPKTKAFVTHGGANGI YEAIHHGI PMVGLPLFGEQPD 402 

Qy 397 NMVT^VEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASV 456 

I : : I I : : | : : | I :| :: I II : I I hill 
Db 403 NIAHMTAKGAAI RLNWKTMSSEDLLNALKTVINDPSYKENVMTLSSIHHDQPMKPLDRAV 462 

Qy 457 GWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL 502 

I I : : I: : I I I I : I : : I I I h : : I 

Db 463 FWIEYVMRHKGAKHLRVAAHDLTWFQYHSLDWGFLVSCAAFLIFL 508 



RESULT 8 
A35343 

glucuronosyltransf erase (EC 2.4.1.17) - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 14-Sep-1990 #sequence_revision 14-Sep-1990 #text_change 09-Jul-2004 
C;Accession: A35343 

R;Sato, H. ; Koiwai, O. ; Tanabe, K. ; Kashiwamata, S. 

Biochem. Biophys . Res. Commun. 169, 260-264, 1990 

A; Title: Isolation and sequencing of rat liver bilirubin UDP- 

glucuronosyltransf erase cDNA: possible alternate splicing of a common primary 
transcript. 

A; Reference number: A35343; MUID : 90274676; PMID:2112380 
A;Accession: A35343 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-531 <SAT> 

A; Cross-references: UNIPROT : P20720; GB:M34007; NID:g207578; PIDN : AAA42312 . 1; 
PID:g207579 

C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase ; transmembrane protein 



Query Match 23.8%; Score 658; DB 2; Length 531; 

Best Local Similarity 30.9%;.. Pred. No. 1.6e-43; 

Matches 171; Conservative 106; Mismatches 211; Indels 66; Gaps 17; 



Qy 1 MAGQRVXLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: I III: II : I I : I : : I I I : I I I : I II : I 
Db 7 LRGLSGLLLLLCALP WAEGGKVX-VFPMEGSHWLSMRDWRELHARGHQAWL 58 

Qy 61 GPFMPDFKKEEKSYQVISWLAP EDHQREF KKSFD FFLEETLGGRGK 106 

I : | | : : : : | I :: I I I III: II II: I 

Db 59 APEVTVHMKGEDFFTLQTYAFPYTKEEYQREILGNAKKGFEPQHFVKTFF— ETMASIKK 116 

Qy 107 FENLLNVXEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAIL 166 

I : I | | : | : | : : | : : | | : | : : I I : I : I I I I 

Db 117 FFDL YAN S CAALLHN KT L I QQLN S S S FDWLT D P VFPCGALLAK YLQ I PAVFFL 170 

Qy 167 STSFGSLEF GLPI PLS YVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQST 220 

: : : : I I I I : I :: I : I I I I Mill : I ■ I : I 
Db 171 RSVPCGIDYEATQCPKPSSYIPNLLTMLSDHMTFLQRVKNMLYPLTLKYIC HLSIT 226 

Qy 221 FDNTIKEHFTEGSR PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIK 277 

: : : I I I I I : I I I I I : I I : : I I I : : I I : I 

Db 227 PYESLASELLQREMSLVEVLSH ASVWLFRGDFVFDYPRPIMPNMVFIGGINCVIKK 282 

Qy 278 PVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWP 337 

I : I : I : : I : I I : H I I J I : : I : I : I I : : I : : 
Db 283 PLSQEFEAYWASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTLLWRYTGT 338 

Qy 338 KDVHLAANVlttvTDWLPQSDLLAHPSIRLF^ 397 

: : I I I : I I I I I : I I I II I I : I I I : I I I : I I I I I : I I I I I I : I 
Db 339 RPSNLAKNTILWWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMV>^ 398 

Qy 398 MVT^VTSAKKFGVSIQLKKLKAETLALKMKQIMEDK^ 457 

I : I : I I : : : : : I : I : I : : : I I I : I : : I : I I 
Db 399 AKRMETRGAGVTLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVF 458 

Qy 458 WIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGM 509 

I : : : I : : I I I I : I I : : : II I I I : T : : : II I 

Db 459 WvTSYVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLL^ 517 



Qy 510 AVWWLRGARKVKET 523 

I : I I :: 

Db 518 GKGRVKKS 525 



RESULT 9 „ 

157961 

glucuronosyltransf erase (EC 2.4.1.17) precursor - rat 
N;Alternate names: glucuronosyltransf erase 1 Bl; morphine UGT 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Aug-1996 #sequence_revision 12-Aug-1996 #text_change 09-Jul-2004 
C;Accession: 157961; S51197; S68333 

R;Coffman, B.L.; Green, M.D.; King, CD. ; Tephly, T.R. 
Mol. Pharmacol. 47, 1101-1105, 1995 

A; Title: Cloning and stable expression of a cDNA encoding a rat liver UDP- 
glucuronosyltransf erase (UDP-glucuronosyltransf erase 1.1) that catalyzes the 
glucuronidation of opioids and bilirubin. 



A;Reference number: 157961; MUID : 95327065 ; PMID:7603447 
A/Accession: 157961 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-535 <RES> 

A; Cross-references: UNIPROT : Q64550; EMBL:U20551; NID:g695161; PIDN : AAC52219 . 1 ; 
PID:g695162 

R;Ishii, Y.; Tsuruda, K. ; Tanaka, M. ; Oguri, K. 
Arch. Biochem. Biophys . 315, 345-351, 1994 

A; Title: Purification of a phenobarbital-inducible morphine UDP- 

glucuronyltransf erase isoform, absent from gunn rat liver. 

A;Reference number: S51197; MUID: 95077409; PMID:7986077 

A;Accession: S51197 

A;Molecule type: protein 

A; Residues: 30-41 <ISH> 

R;Ikushiro, S.; Emi, Y.; Iyanagi, T. 

Arch. Biochem. Biophys. 324, 267-272, 1995 

A;Title: Identification and analysis of drug-responsive expression of UDP- 
glucuronosyltransferase family 1 (UGT1) isozyme in rat hepatic microsomes using 
anti-peptide antibodies . 

A; Reference number: S68333; MUID: 96132654 ; PMID: 8554318 

A;Accession: S68333 

A;Molecule type: protein 

A; Residues: 30-37 <IKU> 

C; Genetics : 

A; Gene: UGT1 . 1 

C; Superfamily : glucuronosyltransf erase 

C; Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.8%; Score 658; DB 2; Length 535; 

Best Local Similarity 31.4%; Pred. No. 1.7e-43; 

Matches 171; Conservative 97; Mismatches 215; Indels 62; Gaps 14; 

Qy 13 LLPGVLL SEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I I I : I I I I I : I I : I I I : I I I I I I III:: I 
Db 13 LLPCLLLCVLGPSASHAGKLLVI-PIDGSHWLSMLGVIQQLQQKGHEVWI APEAS 67 

Qy 66 DFKKEEKS YQVI SWLAPEDHQREFKKS FDFFLEETLGGRGKFEN LLNVLE 115 

II | : : | : : I : I I I I : III:: 

Db 68 I HI KEGS FYTMRKYPVP FQNENVTAA FVEL GRSVFDQDPFLLRWKTYNKVKR 120 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGS 172 

I | | | | : : | | | : : | | : : : | I : : I : I I I I : I 
Db 121 DSSMLLSGCSHLLHNAEFMASLEQSHFDALLTDPFLPCGSIVAQYLSLPAVYFLNALPCS 180 

Qy 173 LEF GLPI PLS YVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQ S T FDNT I K 226 

I : I I I I I I I I ' I I I : I I I I I : : : | | : | 

Db 181 LDLEATQCPAPLSYVPKSLSSNTDRMNFLQRVKNMIIALTENFLCRWYSPYGSLATEIL 240 

Qy 227 EHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENF 286 

: : II I : I : : I I I : I I : : I I I : : I I : I : I : I : 

Db 241 Q KEWWDLLSPASIWLMRNDFV1<DYPRPIMPNMVFIGGINCLQKKALSQEFEAY 295 



Qy 

Db 



287 IAKFGDSGFVXWLGSMVttTCQNPEIFKEMNNAFAHLPQG 346 

: I : I I : : I I I I I : : I : I : I I I : I : : : : I I I 
296 VNASGEHGI WFSLGSMVSEI PEKKAM-EIAEALGRI PQTVLWRYTGT RPSNLAKNT 351 



Qy 347 KIVDWLPQSDLLAHPSIRLFVTHGGQNSIM 406 

• : I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I I I I : I I - I = 
Db 352 ILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPbWMMPLFGDQMDNAKRMETRGA 411 

Qy 407 GVS IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVT LRSHPLS PTQRLVGWI DHVLQTG 466 

| | : : : : : I : I : I : : : I I I : I : : I : I I I : : : I : : 
Db 412 GVTLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVFWVEYVMRHK 471 

Qy 467 GATHLKPYVFQQPWHEQYLFDVEVFLLGLTLGTLWL CGKLLGMAVWWLRGAR 518 

I I I I : I I : : : I I M I : I : : : III I 
Db 472 GAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVYKSCAYGCRKCFG GKG 524 

Qy 519 KVKET 523 

: I I :: 

Db 525 RVKKS 529 



RESULT 10 
JN0619 

glucuronosyltransf erase (EC 2.4.1.17) 2B-4 precursor - human 
N;Alternate names: UDP-glucuronosyltransf erase 2B-11 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 09-Jul-2004 
C;Accession: JN0619; A27878 

R;Jin, C.J.; Miners, J.O.; Lillywhite, K.J.; Mackenzie, P.I. 
Biochem. Biophys . Res. Commun. 194, 496-503, 1993 

A;Title: cDNA cloning and expression of two new members of the human liver UDP- 
glucuronosyltransf erase 2B subfamily. 

A;Reference number: JN0619; MUID: 93326164 ; PMID:8333863 
A;Accession: JN0619 
A;Molecule type: mRNA 
A; Residues: 1-528 <JIN> 

A;Cross-references: UNIPROT : P06133; GB:AF081793; NID:g3426331 
A; Experimental source: liver 

R; Jackson, M.R.; McCarthy, L.R.; Harding, D. ; Wilson, S.; Coughtrie, M.W.H.; 
Burchell, B. 

Biochem. J. 242, 581-588, 1987 

A;Title: Cloning of a human liver microsomal UDP-glucuronosyltransf erase cDNA. 
A;Reference number: A27878; MUID : 87241362 ; PMID:3109396 
A;Accession: A27878 
A; Molecule type: mRNA 

A; Residues: 1-108, 1 F 1 , 110-170, » RP 1 , 173-381, 1 K 1 , 383-384 , 1 SPR 1 , 388-395, 'F', 397-528 
<JAC> 

A;Cross-references: GB:Y00317; NID:g37588; PIDN : CAA684 15 . 1 ; PID:g37589 
C; Genetics : 

A;Gene: GDB:UGT2B4; UGT2B11 

A; Cross-references: GDB: 5891331; OMIM: 600067 

A; Map position: 4ql3-4ql3 

C; Super f amily : glucuronosyltransf erase 

C; Keywords : glycoprotein; glycosyltransf erase; hexosyltransf erase; transmembrane 
protein 

F; 1-2 3 /Domain : signal sequence #status predicted <SIG> 

F;24-528/Product : glucuronosyltransf erase 2B-11 #status predicted <MET> 
F; 4 92-5 09/ Domain: transmembrane #status predicted <TMM> 
F;315/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 



23.7%; Score 656.5; DB 2; Length 528; 



Best Local Similarity 33.9%; Pred. No. 2.1e-43; 

Matches 173; Conservative 85; Mismatches 213; Indels 39; Gaps 15; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVI SWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 34 SHWMNI KTI LDELVQRGHE VTVLAS SAS I S FDPNS PSTLKFEVYPVSLTKTEFEDI I KQL 93 

Qy 90 KKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDM 141 

I : I : I : I : : I I : : I : I I : I I : 

Db 94 VKRWAELPKDTFWSYLSQVQEIMWTFNDILRKF CKDIVSNKKLMKKLQESRFDV 147 

Qy 142 VIVET FDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPIPLSYVPVFRSLLTDHM 195 

I: : II hill III I I I -I II I Mill I hi I 

Db 148 VLADA — VFPFGELI^VELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPVVMSELSDQM 205 

Qy 196 DFWGRVT<N--FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPvljSHLLLKAELWFINSDF 253 

I MM : : : : I I : M : I I I II : M : : I I : : 

Db 206 TFIERVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYW 262 

Qy 254 AFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEI 312 

II I I I II Mill M II M : : : I I : I : M I : M M I I I I : I 

Db 263 DFQFPHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE — ER 320 

Qy 313 FKEMNNAFAHLPQGVIWKCQCSHWPKDVTiLAANVKIVT^ 372 

: : I I : II I : I : : I I I : : I M I M M I I I I M I I I 
Db 321 ANVI ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGA 377 

Qy 373 NSIMEIAJQHGVPMVGIPLFGDQPENMVT^VEAKKFGVSIQLKKLKAETLALKMKQIME 432 

I I i I T I i : I I I I : I I I I I : I : : :\ I I I : : : I : I : : I 
Db 378 NGIYEAIYHGIPMVGVPLLADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPL 437 

Qy 433 YKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPWHEQYLFDVFVFL 492 

111:11 | : | | | | | : I : : I I I I : I : : I I I I 

Db 438 YKENAMKLSRIHHDQPWPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFL 497 

Qy 493 LGLTLGTLWLCGKLLGMAVW-WLRGARKVK 521 

I : : : I I I I : : I : I I 

Db 498 LACVATVI FI ITKCL-FCVWKFVRTGKKGK 526 



RESULT 11 
A35366 

glucuronosyltransf erase (EC 2.4.1.17) UDPGTh-2 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 17-Aug-1990 #sequence_revision 17-Aug-1990 #text_change 09-Jul-2004 
. C; Accession : A35366 

R;Ritter, J.K.; Sheen, Y.Y.; Owens, I.S. 
J. Biol. Chem. 265, 7900-7906, 1990 

A; Title: Cloning and expression of human liver UDP-glucuronosyltransf erase in 

COS-1 cells. 3,4-Catechol estrogens and estriol as primary substrates. 

A; Reference number: A35366; MUID: 90243659 ; PMID:2159463 

A; Accession: A35366 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-529 <RIT> 

A;Cross-references: UNIPROT : P16662 ; GB:J05428; NID:g340079; PIDN : AAA36793 . 1 ; 
PID:g340080 



C; Genetics : 

A; Gene: GDB : UGT2B7 ; UGT2B9 

A;Cross-references: GDB: 5892203; OMIM: 600218 

A; Map position: 4ql3-4ql3 

C; Super family: glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 23.6%; Score 654.5; DB 2; Length 529; 

Best Local Similarity 31.5%; Pred. No. 3.1e-43; 

Matches 163; Conservative 88; Mismatches 212; Indels 55; Gaps 14; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG-PFMPD FKKEEKSY Q 75 

II:::: I I I I I : I I I : I I : : : I 

Db 34" SHWMNI KTI LDELIQRGHEVTVLAS SAS I LFDPNNS SALKI EI YPTSLTKTELENFIMQQ 93 

Qy 76 VISWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDS 133 

: | |:1 : | : : : : : : : : : | : : | I 

Db 94 IKRWSDLPKD TFWLYFSQV QEIMS I FGDITRKFCKDWSNKKFMKK 139 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTS FGSLEFGLPI PLS YVPVFRS 189 

:: II::: I I : I I I I I I I I I I I I I I I I I 

Db 140 VQESRFDVIFADAIFPCSELLAELFNIPFVYSLSFSPGYTFEKHSGGFIFPPSYVPVVMS 199 

Qy 190 LLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTE GSRPVLSHLLLKA 244 

I I I I I I I I I : : : : I I II : I I I I " : I I 

Db 200 ELTDQMTFMERVKNMIYVLYFDF WFEI FDMKKWDQFYSEVLGRPTTLSETMGKA 253 

Qy 245 ELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV 304 

: : I I : : I I I I I I I : I I I I II I I : I : : : I : I : I : : I I : : I I I I I 
Db 254 DWLIRNSWNFQFPHPLLPim)FVGGLHCKPAKPLPKEMEDFVQSSGENGVWFSLGSMV 313 

Qy 305 NTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDW 364 

: | : : | | : M | : I : : I I I :: I : I h I I I II I 

Db 314 SN-MTEERANVIASALAQI PQKVLWRFDGN — —KPDTLGLNTRLYKWI PQNDLLGHPKTR 369 

Qy 365 LFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVT^VEAKKFGVS IQLKKLKAETLALKM 424 

I : I I I I I I I I I I I : I I I I I I I I I II : I : : : I : I : : : I : 
Db 370 AFITHGGANGIYEAIYHGIPMVGIPLFADQPDNIAHMKARGAAVRVDFNTMSSTDLLNAL 429 

Qy 425 KQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQY 484 

I : : : I I I : I I I : I I I I I : I : : I I I I : I : : 

Db 430 KRVINDPSYKENVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYH 489 

Qy 485 LFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

I I I I I : : : , I I : I I I I = 

Db . 490 SLDVIGFLLVCVATVIFIVTKCCLFCFW— KFARKAKK 525 



RESULT 12 
S15089 

glucuronosyltransf erase (EC 2.4.1.17) - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Nov-1993 #sequence_revision 10-Nov-1995 #text_change 20-Jun-2000 
C; Accession: S 150 8 9 

R;Lazard, D.; Zupko, K. ; Poria, Y. ; Nef, P.; Lazarovits, J.; Horn, S.; Khen, M. ; 
Lancet, D. 

Nature 349, 790-793, 1991 



A; Title: Odorant signal termination by olfactory UDP glucuronosyl transferase. 
A; Reference number: S15089; MUID: 91156050; PMID: 1900353 
A;Accession: S15089 
A; Molecule type: mRNA 
A; Residues: 1-527 <LAZ> 

A;Cross-references: GB:X57565; NID:g57762; PIDN : CAA40797 . 1 ; PID:g3980217 

C; Super f amily : glucuronosyl trans f erase 

C; Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.5%; Score 650; DB 2; Length 527; 

Best Local Similarity 31.2%; Pred. No. 6.9e-43; 

Matches 166; Conservative 87; Mismatches 205; Indels 74; Gaps 12; 

Qy 7 LLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP- 65 

I I : I I I : I I : I I I : I : : I I I I I : I I : . 

Db 12 LSLLGMSLGGNVL IWPMEGSHWLNVKIIIDELLRKEHNVTVLVASGALFITP 63 

Qy 66 DFKKEE KSYQVISWL APEDHQREFKKSFDFFLEETLGG 103 

III: | : | : : | | : I : I : II 

Db 64 SVSPSLTFEIYPVPFGKEKIESVIKDF-VLTWLENRPSPSTIWTFYKEMAKVIEEFHLVS 122 

Qy 104 RGKFENLL^A^LEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFV 163 

II I | : : | | : | : : : : : I : : I I I I I I : 

Db 123 RG ICDGVLKNEKLMTKLQRGKFEVLLSDPVFPCGDIVALKLGIPFI 168 

Qy 164 AILSTSFGSLEFG LPI PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCR 212 

III : I I I I I I I I I I I I I I : I I : : I 
Db 169 Y SLRFSPASTVEKHCGKVPFPPSYVPAILSELTDQMSFADRVRNFISY R 217 

Qy 213 RQQHMQ S T FDNT I KEH FT E — GSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGG 270 

|:|| : : : : | | : I I I : I : : : I : I I I I I I : I I I 

Db 218 MQDYMFETLWKQWDSYYSKALGRPTTLCETMGKAEIWLMRTYWDFEFPRPYLPNFEFVGG 277 

Qy 271 LMEKPI KPVPQDLENFIAKFGDSGFVXVTLGSMVNTCQNPEI FKEMNNAFAHLPQGVIWK 330 

I | | | | : | : : : | | : I : I I : : I I I I I I : : I I : I I I : I : 

Db 278 LHCKPAKPLPKEMEEFVQTSGEHGWVFSLGSMVT<NL-TEEKA^ 336 

Qy 331 CQCSHWPKDV71LAANvT<IVTWLPQSDLLAHPSIRLFW 390 

: I : I : : I I : I I : I I I I I I I : I I I I I I I I I I I : I I I I : I :' 

Db 337 YK GKI PATLGSNTRLFDWI PQNDLLGHPKTRAFITHGGTNGI YEAI YHGI PMVGVPM 393 

Qy 391 FGDQPENM\mVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLS 450 

I I I I : I : : : I I I : : : : I : : : : : I I I : II I : 

Db 394 FADQ P DN I AHMKAKGAAVT5 VNMNTMT S ADLLS AVRAVI NE P FYKENAMRL S RI HHDQ P VK 453 

Qy 451 PTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL 502 

I I I I I : I : : I I I I : I : : I I I I I : I 

Db 454 P L D RAVFW I E FVMRH KGAKH L RVAAH DL S WFQ YH S L DVI G FLLACMAS AI LL 505 



RESULT 13 
A48633 

glucuronosyltransf erase (EC 2.4.1.17) precursor - human 
N; Alternate names: dihydrotestosterone/androstanediol UDP- 

glucuronosyltransferase isoform 3, udpgth-3; UDP glucuronosyltransf erase 2 

family, protein B15 

C; Species: Homo sapiens (man) 



C;Date: 07-Apr-1994 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 
C;Accession: A48633; 138559 

R;Chen, F. ; Ritter, J.K.; Wang, M.G.; McBride, O.W.; Lubet, R. A. ; Owens, I.S. 
Biochemistry 32, 10648-10657, 1993 

A; Title: Characterization of a cloned human dihydrotestosterone/androstanediol 
UDP-glucuronosyltransferase and its comparison to other steroid i so forms . 
A; Reference number: A48633; MUID: 94002056; PMID: 8399210 
A;Accession: A48633 

A; Status: preliminary; not compared with conceptual translation 

A; Molecule type: nucleic acid 

A; Residues: 1-530 <CHE> 

A; Cross-references : UNIPROT : P54855 

A; Experimental source : liver 

A;Note: sequence extracted from NCBI backbone (NCBIP : 138786) 
R;Green, M.D.; Oturu, E.M.; Tephly, T.R. 
Drug Metab. Dispos. 22, 799-805, 1994 

A;Title: Stable expression of a human liver UDP-glucuronosyltransferase 
(UGT2B15) with activity toward steroid and xenobiotic substrates. 
A; Reference number: 138559; MUID : 951368 67 ; PMID: 7835232 
A;Accession: 138559 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-530 <RES> 

A; Cross-references: EMBL:U08854; NID:g475758; PIDN : AAC50077 . 1 ; PID:g475759 
C; Genetics : 

A; Gene: GDB : UGT2B15 ; UGT2B8 

A;Cross-references: GDB: 5892418; OMIM: 600219 

A; Map position: 4ql3-4ql3 

C; Superf amily : glucuronosyltransf erase 

C; Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.4%; Score 649; DB 2; Length 530; 

Best Local Similarity 31.8%; Pred. No. 8.3e-43; 

Matches 155; Conservative 90; Mismatches 218; Indels 24; Gaps 10; 

IHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVI-SWLAPEDHQREFKKS 92 

I : : I : : I I I I I : I : I : I : I I : I 

IHWINMKTILEELVQRGHEVTVLTSSASTLVNASKSSAIKLEVYPTSLTKNDLEDSLLKI 93 

'DFFLEETLGGRGKFENLLNVLEYLALQ CSHFLNRKDIMDSLKNENFDMVTV 144 

I : : : | : : | : | : I : | : | | : I I : : : 



I I : I I II: I I I : I I I I I I II I hi I 



II : : : I I : : : : I I I I : I I I : I I : : I : I 



Qy 


34 


Db 


34 


Qy 


93 


Db 


94 


Qy 


145 


Db 


152 


Qy 


201 


Db 


212 


Qy 


259 


Db 


269 


Qy 


319 



I I I I I : I I I I | | I I : | : : : I I : I : : I I : : I I I 



I I : I I I : I : I I : I : : I I I I : I I I I I : I : I I I I I I I I 



Db 328 ALAQI PQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEA ■ 384 

Qy 379 IQHGVPIWGIPLFGDQPENM\mVEAKKFGVSIQLKKLKAETI^KMKQIMEDKRYKSAA 438 

I I I : I I I I I I I I I I : I : ::|| :|: :: : : I :| :: I II : 
Db 385 I YHGI PMVGI PLFADQHDNIAHMKAKGAALSVDI RTMSSRDLLNALKSVINDPVYKENVM 444 

Qy 439 AASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLG 498 

II I : I I I I I : I : : I I I I : I : : II III 

Db 445 KLSRIHHDQPMKPLDRAVFWIEFVMRHKGAKHLRVAAHNLTWIQYHSLDVIAFLLACVAT 504 

Qy . 499 TLWLCGK 505 

• • • i 

Db 505 VIFIITK 511 



RESULT 14 
JN0620 

UDP-glucuronosyltransferase (EC 2.4.1.-) 2B-10 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 09-Jul-2004 
C; Accession: JN0620 

R;Jin, C.J.; Miners, J.O.; Lillywhite, K.J.; Mackenzie, P.I. 
Biochem. Biophys . Res. Commun. 194, 496-503, 1993 

A; Title: cDNA cloning and expression of two new members of the human liver UDP- 
glucuronosyltransferase 2B subfamily. 

A; Reference number: JN0619; MUID : 93326164 ; PMID: 8333863 ' 
A;Accession: JN0620 
A;Molecule type: mRNA 
A; Residues: 1-528 <JIN> 

A;Cross-references: UNIPROT : P36537 ; GB:X63359; NID:g516149; PIDN : CAA44961 . 1 ; 
PID:g516150 

A; Experimental source: liver 

C; Superf amily : glucuronosyltransf erase 

C; Keywords : glycoprotein; glycosyltransf erase; hexosyltransf erase; transmembrane 
protein 

F; 1-2 3/ Domain : signal sequence ffstatus predicted <SIG> 

F;24-528/Product: UDP-glucuronosyltransferase 2B-10 ttstatus predicted <MET> 
F; 491-508/Domain: transmembrane #status predicted <TMM> 

F; 66, 314, 481/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 23.2%; Score 642; DB 2; Length 528; 

Best Local Similarity 31.6%; Pred. No. 2.9e-42; 

Matches 167; Conservative 82; Mismatches 213; Indels 66; Gaps 13; 

Qy 31 VGGSHYLLMDRVSQILQD HGHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQR 87 

I : I I : I I : t~ I I I I : I I : : II 

Db 27 VWAAE Y S LWMNMKT I L KE LVQ RGH EVT VL ASSASIL— FDPNDSST 70 

Qy 88 EFKKS FDFFLEETLGGRGKFENLLNVL EYLALQ 120 

: : I : I : I I I : : I : : 

Db 71 LKLEVYPTSLTKT EFENI IMQLVKRLSEIQKDTFWLP FSQEQEI LWAINDI I RNF 125 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FG 176 

I : : I : I I : I I : I : : I I : I I III I I I I I I 
Db 126 CKDWSNKKLMKKLQESRFDIVFADAYLPCGELLAELFNIPFVYSHSFSPGYSFERHSGG 185 



Qy 



177 LPIPLSYVPVFRSLLTDHMDFWGRVKN— FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSR 234 



I I I I I I 11:111 I I I I : : : : I I : : I : : I I I 

Db 186 FI FP P S YVP WMS KLS DQMT FMERVKNMLYVL Y FD FW FQ I FNMKK- WDQ FY S E — VLGRP 242 

Qy 235 PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSG 294 

II : I I : : I : : : I I I I I I : I I I I I I I I : I : : : I I : I : : I 
Db 243 TTLSETMRKADIWLMFU^SWNFKFPHPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENG 302 

Qy 295 FVXVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSH^ 354 

I : : I I I I I : I : I I . : I I I : I : : I I . I : : I : I I 
Db 303 VWFSLGSMVSN-MTEERANVIATALAKIPQKVLWRFDGN KPDALGLNTRLYKWIPQ 358 

Qy 355 SDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMV^ 414 

: I I I I I I I : I I I I I I I I I I I : I I I I I I I I I I I : I : : : I I I : 
Db 359 NDLLGHPKTRAFITHGGANGIYEAIYHGIPMVGIPLFFDQPDNIAHMKAKGAAVRVDFNT 418 

Qy 415 LKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP Y 474 

: : I : | : : | | I : I I I : I I I I I : I : : I I I I : 

Db 419 MSSTDLLNTU^KTVINDPSYKENIMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVA 478 

Qy 475 VFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

I : : II III I : : I I : I I I I : 

Db 479 AHNLTWFQYHSLDVIGFLLACVATVLFIITKCCLFCFW — KFARKGKK 524 



RESULT 15 
S07390 

glucuronosyltransf erase (EC 2.4.1.17) 3 precursor - rat 

N; Alternate names: 17beta-hydroxysteroid UDP-glucuronosyltransf erase; UDP- 

glucuronosyltransf erase isoform 50K 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 07-May-1999 
C;Accession: S07390; A33236; A28460; S59626 

R;Harding, D.; Wilson, S.M.; Jackson, M.R.; Burchell, B. ; Green, M.D.; Tephly, 
T.R. 

Nucleic Acids Res. 15, 3936, 1987 

A; Title: Nucleotide and deduced amino acid sequence of rat liver 17beta — 

hydroxys teroid UDP-glucuronosyltransf erase. 

A; Reference number: S07390; MUID : 87231096; PMID: 3108864 

A;Accession: S07390 

A; Molecule type: mRNA 

A; Residues: 1-530 <HAR> 

A; Cross-references : EMBL: Y00156 

A; Experimental source: liver 

A; Accession: A33236 

A; Molecule type: protein 

A; Residues: 24-61 <HAR2> 

A; Experimental source: liver 
R;Mackenzie, P.I. 

J. Biol. Chem. 262, 9744-9749, 1987 

A; Title: Rat liver UDP-glucuronosyltransf erase . Identification of cDNAs encoding 
two enzymes which glucuronidate testosterone, dihydrotestosterone, and beta- 
estradiol. 

A;Reference number: A28460; MUID: 87250645; PMID:3110162 
A; Accession: A28460 
A; Molecule type: mRNA 

A; Residues: 1-118, 1 G 1 , 120-240, '1/ ,242-423, 'S' ,425-499, 'T' ,501-530 <MAC> 
A; Experimental source: liver 



R;Yamashita, A.; Watanabe, M. ; Tonegawa, T.; Sugiura, T.; Waku, K. 
Biochem. J. 312, 301-308, . 1995 

A; Title: Acyl-CoA binding and acylation of UDP-glucuronosyltransf erase isoforms 

of rat liver: their effect on enzyme activity. 

A; Reference number: S59626; MUID : 96077159; PMID: 7492328 

A;Accession: S59626 

A;Molecule type: protein 

A; Residues: 24-44 <YAM> 

A; Experimental source: liver 

C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

F; 1-2 3 /Domain: signal sequence #status predicted <SIG> 

F; 2 4-530/ Product : glucuronosyltransf erase #status experimental <MAT> 

F; 494-510/Domain: transmembrane #status predicted <TMM> 

Query Match 23.1%; Score 640; DB 2; Length 530; 

Best Local Similarity 34.2%; Pred. No. 4.2e-42; 

Matches 167; Conservative 81; Mismatches 189; Indels 52; Gaps 15; 
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Search completed: February 15, 2005, 12:55:30 



Job time : 61 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



February 15, 2005, 07:50:18 ; Search time 95 Seconds 

(without alignments) 
1798.840 Million cell updates/sec 

US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS GKLLGMAVWWLRGARKVKET 523 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



1376875 seqs, 326749119 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1376875 



Post -processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1500 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB .pep : * 

3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/l/pubpaa/US07JNTEW_PUB.pep: * 

6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBC0MB .pep : * 

7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB,pep: * 

8 : /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/l/pubpaa/US09A__PUBCOMB.pep: * 
10 : /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep: * 
11 : /cgn2_6/ptodata/l/pubpaa/US0 9C_PUBCOMB.pep:* 
12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: * 
13 : /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep:* 
14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep: * 
15: / cgn2_6 /p t oda t a / 1 /pubpaa /US 1 0C_PUBCOMB . pep : * 
16 : /cgn2_6/ptodata/l/pubpaa/US10D_PUBCOMB.pep: * 
17 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB.pep:* 
19 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep: * 
20 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result Query- 
No. Score Match Length DB ID 



Description 
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US- 


10 


-437 


Q £ O 

- y bo 


1 C O A 1 

- Ibbo U 1 


C? o/^ri i on z*" 1 o 


ljtDoUl , 


1011 


125 


4 


. 5 


152 


16 


US- 


10 


-767 


- /Ul 


r 1 HOI 

-bi /yi 


Co m i on a*o 
OcquciiLc 


CI 7Q1 A 
01/51, A 


1012 


125 


4 


. 5 


182 


15 


US- 


10 


-425 


1 1 i4 

-114 


-o by4 u 


Qo An i on a*o 
ocquciiL tr 


O (TO/ A A 

O D y 4 U , A 


1013 


125 


4 


. 5 


258 


15 


US- 


10 


-424 


CQQ 

-599 


O /I A A A A 

-240009 


O jzi i^ri i on o 


7 A A A A Q 

Z 4 u u u y , 


1014 


125 


4 


. 5 


369 


14 


US- 


10 


-9R9 

£j O Z? 


- lbb 


O A /Z 

-o 06 


C o/^ri i on f* o 

o cquciicc 


0 A d Ann 

jud, "PP 


1015 


125 


4 


. 5 


527 


15 


IIR- 


X V 


-49 R 

T 61 J 


ii/i 
- 114 


-63888 


O o nri i o n ^"i o 


£ 0 Q Q Q A 
DJOOO, A 


1016 


124 . 5 


4 


. 5 


475 


15 


T J^- 


X \J 


-494 
^ *± 


CQQ 

-byy 


-267504 


O o /^n l o n /~i o 

sequence 


0 £T 1 C A/1 

z b / b U4 ; 


1017 


124 . 5 


4 


. 5 


483 


16 


us_ 


10 


-417 


Q £ 7 

- y bo 


1 1 A A C Q 

- 1 1 U4 b y 


C? o /""yi i o n f» o 

sequence 


1 1 

1 1 U4 b y , 


1018 


124 . 5 


4 


. 5 


4 9R 

*T Z* ~J 


1 R 

X _} 




X u 


-4 94 


-599 


-255704 


Sequence 


O C C *"7 A A 

Zoo I U4 , 


1019 


124 


4 


. 5 


198 


15 




1 0 

X \J 


-49R 


ii/i 

-114 


A O IT O f 

-48536 


oequciice 


4 o bo b , A 


1020 


123 . 5 


4 . 


. 5 


158 


15 




x vy 


-49 R 

r± o «J 


-114 


-52503 


ocquence 


COCAO A 


1021 


123 . 5 


4 . 


. 5 


466 


15 


TJR- 


1 0 

X VJ 


-494 


r— n o 

-byy 


-170054 


C o f~f\ i o n c^l 

ocquencc 


i n a a c: a 
1 / U Ub4 , 


1022 


123 . 5 


4 . 


, 5 


475 


16 


US- 


10 


-437 


- 9oo 


1 A O C 1 "J 

-108517 


Qoati i on a*o 


1 A ft R 1 7 
1UO Jl / , 


1023 


123 . 5 


4 . 


. 5 


503 


16 


US- 


X w 


i o / 


-963 


-136453 


oequence 


1 O CZ A CO 

lo b4 bo , 


1024 


123 


4 . 


, 4 


209 


16 


I J9- 


1 0 

X w 


-417 


-963 


-156797 


C o/^ri i on o 


ibb / y / , 


1025 


123 


4 . 


4 


71 Q 
X Z> 


1 R 

X O 


T TO _ 


X u 


-4 94 


CQQ 

-oyy 


o a q o c o 
-z boobo 


Sequence 


O C Q O C O 

zboo bo , 


1026 


123 


4 , 


, 4 


222 


1 R 




X u 


-4 94 


-599 


-226907 


Sequence 


O O £T Q A 1 

zz b y U / , 


1027 


123 


4 . 


, 4 


464 


X VJ 




X VJ 


-417 
*± o / 


-963 


-199656 


ocquence 


i y y bbb , 


1028 


123 


4 . 


4 


467 


1 R 

X -J 


T TO _ 
UO 


X u 


-97R ■ 
- Z / o 


-782 


-8 


Sequence 


o , Appl 1 


1029 


123 


4 


4 


^± o z* 


X D 


tto_ 


x u 




-963 


-126692 


Sequence 


Izbb yz , 


1030 


123 


4 . 


4 


481 


13 


US- 


10- 


- 078 • 


-929 


-76 


Q O ATI 1 on r~* o 


7fi Arm! 


1031 


122.5 


4 . 


4 


106 


15 


us- 


10- 


-424- 


-599 


-253520 


Sequence 


253520, 


1032 


122 . 5 


4 . 


4 


143 


15 


us- 


10- 


-424« 


-599 


-227255 


Sequence 


227255, 


1033 


122.5 


4 . 


4 


455 


14 


us- 


10- 


-338- 


-777 


-51 


Sequence 


51, Appl 


1034 


122.5 


4 . 


4 


465 


15 


us- 


10- 


-425- 


-114- 


-63136 


Sequence 


63136, A 


1035 


122.5 


4 . 


4 


477 


16 


us- 


10- 


-437- 


-963 


-170646 


Sequence 


170646, 


1036 


122.5 


4 . 


4 


493 


16 


us- 


10- 


-437- 


-963- 


-170486 


Sequence 


170486, 



1037 


122 


4 


4 


3 8 0 


1 0 




09 
vj y 


-7ft9 
/ D y 


-714 


-9ft 
— ZD 


C i d th a 
OcLJU.CllL.C 


9ft Arvr-Nl 


1038 


122 


4 


. 4 


380 


15 


us - 


10 


-107 

X \J 1 


-411 

1 J X 


-991 

£t £t X 


C P»r^n i on o o 

OCLj UCilL. C 


9 91 Bnn 

Z Z X , -"PP 


1039 


122 


4 


. 4 


473 


15 


US- 


10 


-424 


- R99 
~j y y 


-1 791ftft 


C p>pr-i -i on P o 


1 791ftft 

X / Z j O D , 


1040 


122 


4 


, 4 


1046 


16 


us - 


10 


-417 


- 9ftl 


- 1 91 14 9 


C Q<^T1 1 ^T"l f~* Q 

OCv^UlCllC c 


1 91 *J A 9 
x Z x j Z , 


1041 


121.5 


4 


. 4 


157 


16 


US- 


10 


-437 


-9ftl 


- 1 044 Oft 


C om i on o o 

O Cv^UCllL c 


1 044 Oft 


1042 


121 . 5 


4 


, 4 


161 


16 


US- 


10 


-767 


-701 


-RR91ft 


c om i on p o 


RR91ft Z\ 


1043 


121 . 5 


4 


, 4 


481 


16 


US- 


10 


-437 


-963 


-152752 


^ pm i pTi p p 

O Cv^UCl lv_ v3 


1 R97R9 


1044 


121. 5 


4 


. 4 


485 


16 


us- 


10 


-437 


-963 


-145149 


SpmiPiipp 


14514 9 , 


1045 


121 


4 . 


. 4 


153 


16 


US- 


10 


-767 


-701 


-40501 


C om i on pp 


40R01 A 


1046 


121 


4 


. 4 


161 


16 


us- 


10 


-437 


-963 


-122198 


Sequence 


122198, 


1047 


121 


4 . 


. 4 


205 


15 


us- 


10 


-424 


-599 


-229443 


Sequence 


229443, 


1048 


120 . 5 


4 


. 4 


177 


16 


us- 


10 


-437 


-963 


-156799 


Sequence 


156799, 


1049 


120 . 5 


4 


. 4 


311 


16 


us- 


10 


-437 


-963 


-185160 


Sequence 


185160, 


1050 


120 . 5 


4 


. 4 


4 91 


11 

X J 


us- 


10 


-07ft 


-929 


-72 


Sequence 


72, Appl 


1051 


119 . 5 


4 


. 3 


270 


16 


us- 


10 


-417 


-963 


-129189 


Sequence 


129189, 


1052 


119 


4 


. 3 


94 


9 


US-09- 


99 R - 


299- 


1261 


Sequence 12 61, Ap 


1053 


119 


4 


. 3 


94 


10 


T TC - 
Uo 


0 9 
U y 


-99R 


-299 


-1261 


Sequence 


X z b 1 , Ap 


1054 


118 . 5 


4 


. 3 


486 


16 


T TQ 

Uo - 


i n 

J- u 


-048 


-866D-12 


Sequence 


Xz , AppX 


1055 


118 


4 


. 3 


182 


15 


T TC 


i n 


-424 


-599 


-204962 


Sequence 


O A A A O 

2 04962, 


1056 


118 


4 


. 3 


507 


16 




i n 
1 u 


-417 


-963 


-125744 


Sequence 


X25744 , 


1057 


117 . 5 


4 


. 2 


142 


15 


T TC 

Ub - 


i r\ 


-424 


-599 


-235141 


Sequence 


235141, 


1058 


117 


4 


. 2 


435 


15 


T TC 

Uo - 


1 o 
X u 


-49R 

£i J 


-114 


-48716 


Sequence 


A O "7 1 £T 7V 

4 o /Xb , A 


1059 


117 


4 


. 2 


455 


15 


T TC 
Uo - 


X u 


-424 


-599 


-164669 


Sequence 


X64 669 , 


1060 


117 


4 


. 2 


4 ft 1 

1 O -L 


1 R 

X ZJ 


T TC 

Uo - 


1 u 


U J 


-319A-23 


Sequence 


23 , AppX 


1061 


116.5 


4 


2 


ft 09 

D \J Z> 


1 ft 

X o 


T TC 

Uo - 


1 U 


-417 


-963 


-195456 


Sequence 


195456 , 


1062 


lift 

x x u 


4 


2 


4 ft 1 


1 ft 

X D 


T TC 

Uo - 


t a 


-417 
— Hl 3 1 


-963 


-109952 


Sequence 


T A A A IT O 

109952 , 


1063 


116 


4 


. 2 


I/O 


1 ft 
X o 


T TC 

Uo - 


1 U 


-417 


-963 


-194859 


Sequence 


1 Q A o cr A 

1948 59, 


1064 


116 


4 


. 2 


4 88 


16 


T TC 

Ub - 


T A 

X U 


-417 


-963 


-132460 


Sequence 


132460 , 


1065 


115.5 


4 


. 2 


J U 1 


1 R 

X J 


T TO 

Ub - 


1 A 

X u 


-4 94 


-599 


-176275 


Sequence 


176275 , 


1066 


114 . 5 


4 


. 1 


4 94 


16 


T TO 

Ub - 


1 A 


-437 


-963 


-129713 


Sequence 


12 9713 , 


1067 


114 


4 


. 1 


134 


15 


T TO 

Ub - 


10 


-424 


-599 


-264963 


Sequence 


264963 , 


1068 


113 . 5 


4 


. 1 


112 


15 


T TO 

Ub - 


xu 


-424 


-599 


-201975 


Sequence 


OAT f\T(T 

201975 , 


1069 


113 . 5 


4 


. 1 


469 


15 


T TO 

Ub - 


1 A 

10 


-425 


-114 


-70563 


Sequence 


70563, A 


1070 


113 . 5 


4 


, 1 


505 


1 R 

X -J 


T TO 

Ub - 


1 A 
10 


-49R 


-114 


-67480 


Sequence 


/~ A C\ r\ TV 

67480, A 


1071 


113 . 5 


4 


, i 


706 


1 ft 

X o 


T TC 

Ub - 


i n 
1 U 


-417 


-963 


-175099 


Sequence 


T H C A Q A 

x /buy 9 , 


1072 


113 


4 


, i 


225 


1 ft 

X o 


us- 


10 


-417 


-963 


-194133 


Sequence 


194133, 


1073 


113 


4 


t i 


4 72 


1 R 

X J) 


us- 


10 


-4 94 

1 Z. r± 


-599 


-238530 


Sequence 


238530, 


1074 


112 


4 


. 0 


419 


1 ft 

X o 


us- 


10 


-417 


-963 


-126672 


Sequence 


126672, 


1075 


111 


4 


. 0 


124 


1 ft 

X D 


us- 


10 


-7ft7 


-701 


-35499 


Sequence 


35499, A 


1076 


111 


4 


. 0 


475 


15 


us- 


10 


-424 


-599 


-249784 


Sequence 


249784, 


1077 


110 . 5 


4 


. 0 


180 


1R 

X J 


us- 


10 


-424 


-599 


-265054 


Sequence 


265054, 


1078 


110 . 5 


4 


. 0 


283 


16 


us- 


10 


-417 


-963 


-190628 


Sequence 


190628, 


1079 


110 


4 . 


. 0 


146 


16 


us- 


10 


-767 


-701 


-61438 


Sequence 


61438, A 


1080 


110 


4 


0 


x / y 


1 R 

X J 


us- 


10 


-4 94 


-599 


-223670 


Sequence 


223670, 


1081 


109 

X \J J 


-j 


Q 
• y 


4 71 

*x / X 


1 R 

X J 


us- 


10 


-494 


-599 


-224402 


Sequence 


224402, 


1082 


109 




q 


4 7 R 


1 R 

X 3 


us- 


10 


-49R 

*i Z D 


-114 


-58078 


Sequence 


58078, A 


1 Oft 7 


1 09 
x \j y 




q 
. y 


ft 91 


1 ft 
x O 


us- 


10 




-963 


-164331 


Sequence 


164331, 


1 0ft4 

X \J O T 


10ft R 




q 
• y 


4 77 


1 R 
X D 


us- 


10 


-4 94 
- ^± Z 


-599 


-145625 


Sequence 


145625, 


1 ORR 

X \J O ~J 


i rift r 




q 

. y 


ZJ Z / 


1 ft 
X D 


us- 


10 


-4 77 


-963 


-112814 


Sequence 


112814, 


1086 


108.5 




q 
. y 


1232 


1 R 

X z> 


us- 


10 


- 1 ft 9 

J D -7 


-493 


-21495 


Sequence 


21495, A 


1087 


108 


3 . 


, 9 


3079 


15 


us- 


10 


-369 


-493 


-2024 


Sequence 


2024, Ap 


1088 


107.5 


3 . 


. 9 


179 


16 


us- 


10 


-437 


-963 


-175086 


Sequence 


175086, 


1089 


107 


3. 


. 9 


98 


15 


us- 


10 


-424 


-599 


-172741 


Sequence 


172741, 


1090 


107 


3 . 


. 9 


427 


15 


us- 


10 


-282 


-122A-54637 


Sequence 


54637, A 


1091 


106.5 


3 . 


.8 


102 


15 


us- 


10 


-424 


-599 


-241946 


Sequence 


241946, 


1092 


106.5 


3 . 


.8 


389 


11 


us- 


09 


-758 


-759 


-9 


Sequence 


9, Appli 


1093 


106.5 


3 . 


.8 


389 


15 


US- 


10 


-107 


-431 


-223 


Sequence 


223, App 



1094 


105 - 5 


3 


. 8 


118 


16 


US 


-10 


-437 


-963 


-169987 




1 6QQft7 
x o y zj o / t 


1095 


105 . 5 


3 


. 8 


216 


16 


US 


X \J 


-767 


-701 

/ VJ X 


- 7 Rft 99 


Q o iTi i on f~* o 

ocyuciiLc 


J DOZZ , A 


1096 


105 . 5 


3 


. 8 


890 


16 


us 


-10 


-477 


-967 

^ O O 


- 1 S7 ft 1 7 


OCLjUcnCc 


1 D ZS O ± / , 


1097 


105 


3 


. 8 


132 


16 


us 


X \J 


-4 77 


-96 7 

-7 D J 


- 1 1 ft 9 1 4 


SeQusncs 


X 1 o z 14 , 


1098 


105 


3 


. 8 


363 


16 


us 


-10 


-437 


-967 

ZJ O J 


- 1 9 697 6 

l^i DZ J D 




1 9 £ O 7 C 
XZ OZj D , 


1099 


105 


3 


. 8 


408 


9 


US- 


09- 


789 - 


261- 


2 




2 , App 1 i 


1100 


105 


3 


. 8 


408 


15 


us 


-10 


-4 m 

M \J X 


-971 

/ X 




O a *^ri i a y\ 


2 , Appl i 


1101 


105 


3 


. 8 


494 


16 


us 


-10 


-437 


-963 


- 1 7674 9 


O i on c* o 
OCL^Ufc;IlL.c 


1 7 £ 7 4 9 


1102 


104 . 5 


3 


. 8 


449 


16 


us 


-10 


-477 


-967 

.7 O O 


-14 ft 79ft 
x *± o / z, o 


OcLJutrnce 


X4 O / Zo , 


1103 


104 . 5 


3 


. 8 


460 


15 


us 


-10 


-424 


- S99 

-J ZJ ZJ 


-94 R 996 


O o/^n i on f* o 
O C U Cll C C 


z 4 D y y b , 


1104 


104 


3 


. 8 


969 


9 


US- 


09- 


981- 


353 - 


122 


O o fTi ipnro 


1 9 9 Ann 
L Z Z , /\pp 


1105 


104 


3 


. 8 


977 


9 


US- 


09- 


925- 


297- 


797 


CpmipriPP ' 


7Q7 7\nn 


1106 


103 . 5 


3 


. 7 


113 


16 


us 


-10 


-767 


-701 


-42027 


Qom ipnpp 

DC(J[UCilL C 


4 9 n 9 7 a 
ft Z U Z / , 


1107 


103 . 5 


3 


. 7 


183 


16 


us 


-10 


-767 


-701 


-7 7967 


Q om i on o 

o L2 U. 1 1 L. c: 


7 7 9 67 Zi 


1108 


103 . 5 


3 


. 7 


479 


16 


us 


-10 


-437 


-963 


-9 04 97R 


O o rr\ idtipq 

ocLlU.trllt-.tr 


9 OA Q 1 D 


1109 


103 


3 


. 7 


967 


14 


uo 


-in 

X w 




-ft9 7 


-79 


OCLJUCllCc 


JZ, AppX 


1110 


103 


3 


, 7 


967 


14 


us 


-10 


-264 


-374 


-2 01 


Q q /-T-i -l on o o 
OCLj^UcllLtr 


901 Zirvn 

z u x , App 


1111 


103 


3 


, 7 


967 


14 


us 


-10 


-281 


-904 


-2 


0 o m i on f~* o 
OCmJcllLtr 


9 2\ nn "I -i 
Z , Hppi X 


1112 


103 


3 


. 7 


967 


15 


us 


-10 


-n99 

\j zj zj 


-322 


-74 


O o <ti i on o 
OCLJUtrllCC 


'7/1 7\« n 1 
/ft , AppX 


1113 


103 


3 


. 7 


967 


15 




x yj 


- n44 


- R64 


-74 


C o /^ri i on /"■» q 

oequencc 


/4 , AppX 


1114 


103 


3 


. 7 


967 


15 


us 


-10 


-264 


-374 


-201 


O o m 1 on r* 1 o 
OCLjUcllLC 


9ni 7\T-,n 
Z \J ± t App 


1115 


102 . 5 


3 


. 7 


143 


16 


us 


-10 


-437 


-967 


-1 79077 


O o iti i on r** o 
OCLjUtrllLC 


n o o A77 
1j z VzS I, 


1116 


102 . 5 


3 


. 7 


622 


9 


US- 


09- 


981- 


353 - 


12 9 




1 9 9 Arm 


1117 


102 . 5 


3 


. 7 


816 


16 


us 


-10 


-437 


-963 
j \j ~> 


-198301 


O o m i on r* • o 
OCm-icllLtr 


1 9 ft 7 m 


1118 


101.5 


3 


. 7 


114 


16 


VJO 


-in 

X \J 


/ D / 


-7m 

/Ul 


- R7 nft4 


OCLjUCllCc 


jjUo4 , A 


1119 


101.5 


3 


, 7 


174 


1 R 


\J o 


X \J 


-494 

r± Z *± 


_ R9 9 

D zj zj 


-949ft99 
Z ft Z 0 Z Z 


ocCjUcilCc 


Z4z ozz , 


1120 


101.5 


3 


. 7 


385 


1 0 


uo 


_ 0 Q 


-ft 99 


-97 R 
Z / D 


- 6 


ocLjUcllCc 


6 ; Appl i 


1121 


101 


3 


. 6 


340 


16 


T TO 

\~ftzj 


-1 0 

X u 


-4 7 7 

*± O / 


-963 


-189996 


ocLJUcilCc 


xo y y y b , 


1122 


100.5 


3 


. 6 


603 


11 


I 

u o 


vj zj 


-979 


-211 


-102 


ocLjllcllCc 


1 rtn Ann 

X u z , ^pp 


1123 


100 . 5 


3 


. 6 


603 


15 


us 


-10 


-096 


-625 


-102 


O o m i on o 
OCL|U.CllLC 


1UZ , -"PP 


1124 


100.5 


3 


. 6 


842 


14 


us 


X \J 


- nft4 

\J o 


-839 


-2774 


O o i on /— 1 q 
OCLjUCllL C 


9 7 7 A An 
Z / / ft , Ap 


1125 


100 


3 


. 6 


63 


16 


us 


-10 


-437 


-963 


-114703 


O om i on r* 1 o 
OCmxcllLC 


1 1 4 70 7 
X x ft / UJ, 


1126 


100 


3 


. 6 


108 


16 


us 


-10 


-767 


-701 


-58390 




Rft79n 2\ 

JO J 7U / i-i 


1127 


100 


3 . 


. 6 


354 


15 


us 


-10 


-282 


-122A-52804 


O om i pnpp 
OCLJU.C11LC 


c? o H4 a 


1128 


100 


3 


. 6 


388 


15 


us 


-10 


-627 


-476 


-180 


O o fTi i on <— • o 
O C Ll CI 1 L C 


1 ft n Ann 
lou , .rtpp 


1129 


100 


3 . 


. 6 


455 


15 


us 


-10 


-799 

Zj £j ZJ 


-148A-17 


O o m i on r~« o 
OCLjUtrllLc 


1 7 Ann! 
X / , AppX 


1130 


100 


3 . 


. 6 
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19 98 An 


1240 


91 


3 


.3 


366 


15 


us 


-10 


-425 


-114-44445 


Spmipnrp 


44445, A 


1241 


91 


3 


. 3 


599 


15 


us 


-10 


-424 


-599-266797 


J UCi iv_ 


ti O D / -7 / / 


1242 


91 


3 


. 3 


600 


15 


us 


-10 


-425 


-114-55322 






1243 


91 


3 


.3 


686 


15 


us 


-10 


-282 


-122A-74764 


Spmipnrp 


74764 A 


1244 


91 


3 


. 3 


1397 


17 


us 


-10 


-473 


-451-6 


^pffllpnrp 


6 Annl "i 


1245 


91 


3 , 


. 3 


1826 


15 


us 


-10 


-289 


-762-113 


Spmipnrp 


117 Ann 

x x _j / r\^J^J 


1246 


90 . 5 


3 . 


. 3 


99 


16 


us 


-10 


-767 


-701-55235 


CT pipri n piT-i pi pi 


c; c 9 0 c a 


1247 


90 . 5 


3 . 


. 3 


364 


15 


us 


-10 


-369 


-493-11180 


pi pf-i -l pri pi Q 


1 1 1 ft n a 

X X X O U , JTl 


1248 


90 . 5 


3 . 


. 3 


381 


15 


us 


-10 


-369 


-493-1334 


Q pi pri -i piT-i r* P> 


1 7 74 An 


1249 


90 . 5 


3 . 


. 3 


602 


10 


us 


-09 


-953 


-067A-7 


Cprniprirp 

i— > C ^ 1 v_ c. 


7 Annl i 

/ / r\^j£j X X 


1250 


90.5 


3 . 


.3 


602 


15 


us 


-10 


-260 


-937-51 


CpmipfirP 

k-> ^ UCI 1L v3 


SI Annl 


1251 


90.5 


3 . 


. 3 


2111 


16 


us 


-10 


-437 


-963-112998 


CprfTiprirp 
i_J ^ UClHw c 


1 1999ft 

HZ J JO / 


1252 


90.5 


3 . 


. 3 


2783 


15 


us 


-10 


-369 


-493-6344 




674 4 An 


1253 


90 


3 . 


3 


106 


16 


us 


-10 


-767 


-701-55687 


Sequence 


55687, A 


1254 


90 


3 . 


3 


304 


15 


us 


-10 


-424 


-599-183691 


Sequence 


183691, 


1255 


90 


3 . 


3 


357 


15 


us 


-10 


-282 


-122A-61260 


Sequence 


61260, A 


1256 


90 


3 . 


3 


357 


15 


us 


-10 


-282 


- 1 9 ?A- 7 f) R71 


Sequence 


70571, A 


1257 


90 


3 . 


3 


511 


16 


us 


-10 


-686 


-947-166 


Sequence 


166, App 


1258 


90 


3 . 


3 


511 


16 


us 


-10 


-686 


-947-168 


Sequence 


168, App 


1259 


90 


3 . 


3 


620 


9 


US-i 


39-1 


361- 


451A-34 


Sequence 34, Appl 


1260 


90 


3 . 


3 


838 


14 


us- 


-10 


-084 


-839-2789 


Sequence 


2789, Ap 


1261 


90 


3 . 


3 


1953 


15 


us 


-10- 


-369 


-493-1945 


Sequence 


1945, Ap 


1262 


89.5 


3 . 


2 


265 


16 


us 


-10- 


-767 


-701-44289 


Sequence 


44289, A 


1263 


89.5 


3 . 


2 


302 


15 


us- 


-10- 


-424 


-599-212485 


Sequence 


212485, 


1264 


89.5 


3 . 


2 


422 


15 


us- 


-10- 


-335 


-977-7596 


Sequence 


7596, Ap 



1265 


89.5 


3 


.2 


422 


15 


US -10-335-977-7597 




7597 A-n 

/ D J f , -rip 


1266 


89.5 


3 


.2 


426 


15 


US -10-335-977-7598 


OCtJUCUL C 


/ D j O , ±\y 


1267 


89.5 


3 


.2 


596 


15 


US -10-112- 94 4-4 04 




A OA A -r-N-pl 

*± \J *± / r\^J^J 


1268 


89 . 5 


3 


. 2 


665 


15 


US-10-108-260A-3396 


UCHL tr 


119^ in 


1269 


89 . 5 


3 


. 2 


930 


9 


US -09-815-749-1 14 81 




1 *3 A Q 1 A 


1270 


89.5 


3 


.2 


930 


15 


US- 10 -282 -122A-74109 




74 1 0 Q A 


1271 


89.5 


3 


.2 


1403 


16 


US -10-475-711-6 


Q lpripp 


o , A.pp± 1 


1272 


89.5 


3 


.2 


1420 


16 


US-10-475-711-8 




8 Ann 1 i 


1273 


89 . 5 


3 


.2 


1428 


16 


US -10-475-711-2 


O ^/-ji i on o 
o cm»lc J It, fc; 


z , App 1 1 


1274 


89.5 


3 


.2 


1445 


16 


US -10-475-711-4 


C cirri i oti r 1 *^ 


4 Zim^l t 
*± t £\\jy±. X 


1275 


89 


3 


.2 


678 


15 


US -10-282-12 2A-69719 


CT prri 1 Dn PD 


oy/±y t A 


1276 


89 


3 


.2 


866 


14 


US-10-205-219-1 




1 m^i 1 "i 
J- , in,pp X X 


1277 


88.5 


3 


. 2 


113 


15 


US -10-424-599-219817 




9 1 9ft 1 7 
X Zj O X / , 


1278 


88 . 5 


3 


. 2 


198 


15 


US -10-424 -599-250468 




Z D Urt D 0 , 


1279 


88 . 5 


3 


. 2 


616 


16 


US -10-417-961-1 1 5210 


o tr L| u. tr 1 1 C cr 


1 T ROTH 


1280 


88.5 


3 


. 2 


620 


16 


US -10-408-76 5A -2886 


iD cv^iacriHw tr 


9 ft ft £ A-m 


1281 


88 . 5 


3 


.2 


722 


15 


US- 10 -282 -122A-71737 


Qppri i p-npn 

LDcr^Licin_tr 


71 717 A 


1282 


88.5 


3 


.2 


726 


10 


US-09-932-257A-19 


Sequence 


19, Appl 


1283 


88.5 


3 


.2 


726 


14 


US-10-151-274-6 


Sequence 


6, Appli 


1284 


88 . 5 


3 


. 2 


1561 


15 


US -10-369-493-1131 


Sequence 


1131, Ap 


1285 


88 


3 


. 2 


374 


15 


US -10-369-493 -13548 


Sequence 


13548, A 


1286 


88 


3 


.2 


388 


14 


US-10-138-701-57 


Sequence 


57, Appl 


1287 


88 


3 


.2 


430 


15 


US-10-084-84 6A-62 


Sequence 


62, Appl 


1288 


88 


3 


. 2 


500 


9 


US -09- 73 1-8 72 -466 


Sequence 4 66, App 


1289 


88 


3 


. 2 


500 


10 


US - 09-876-997-466 


Sequence 


466, App 


1290 


88 


3 


.2 


638 


15 


US -10-425-1 14 -70558 


Sequence 


70558, A 


1291 


88 


3 


.2 


708 


15 


US- 10 -282 -122A-684 62 


Sequence 


68462, A 


1292 


88 


3 


.2 


838 


10 


US-09-864 -636A-66 


Sequence 


66, Appl 


1293 


88 


3 


. 2 


838 


10 


US -09-758-289-961 


Sequence 


261, App 


1294 


88 


3 


. 2 


838 


11 


US-09-864 -42 6A- 66 


Sequence 


66, Appl 


1295 


88 


3 


. 2 


838 


14 


US -10-084-839-66 


Sequence 


66, Appl 


1296 


88 


3 


.2 


979 


15 


US -10-360- 52 2-58 


Sequence 


58, Appl 


1297 


88 


3 


.2 


1153 


9 


US -09-9 63 -137-202 


Sequence 2 02, App 


1298 


88 


3 


. 2 


1153 


10 


US- 09- 962 -8 54A-3 


Sequence 


3 , Appl i 


1299 


88 


3 


. 2 


1153 


10 


US-09-963-131-202 


Sequence 


202, App 


1300 


88 


3 


. 2 


1153 


11 


US-09-397-967-14 


Sequence 


14, Appl 


1301 


88 


3 


. 2 


1948 


16 


US -10-437-961-1 178 66 


Sequence 


137866, 


1302 


88 


3 


. 2 


19723 


15 


US- 10- 084 -846A-5 


Sequence 


5, Appli 


1303 


87 . 5 


3 


. 2 


187 


15 


US-10-495-1 14-18959 


Sequence 


38952, A 


1304 


87 . 5 


3 . 


. 2 


254 


15 


US -10-494-599-960101 


Sequence 


260301, 


1305 


87 . 5 


3 . 


. 2 


305 


15 


US- 10-2 89 -199A-74991 


Sequence 


74291, A 


1306 


87 . 5 


3 . 


.2 


359 


14 


US -10- 012 -8 19-164 


Sequence 


164, App 


1307 


87.5 


3 . 


.2 


401 


15 


US -10-282- 122A-52890 


Sequence 


52890, A 


1308 


87 . 5 


3 . 


2 


432 


9 


US-09-8 81-759A-1 9R 


Sequence 198, App 


1309 


87 . 5 


3 . 


2 


447 


9 


US -09-815-242- 13 490 


Sequence 13490, A 


1310 


87 . 5 


3 . 


.2 


447 


9 


US -09-815-242-13612 


Sequence 13612, A 


1311 


87 . 5 


3 . 


2 


447 


15 


U^-l 0-989-1 99A-74098 


Sequence 


74098, A 


1312 


87 . 5 


3 . 


2 


472 


15 


US- 10 -282 -199A-54597 


Sequence 


54597, A 


1313 


87 . 5 


3 . 


2 


575 


15 


US- 10-104 -04 7-1699 


Sequence 


3622, Ap 


1314 


87 . 5 


3 . 


2 


578 


15 


U^-l 0-989-1 99A-441 l 1 ^ 


Sequence 


44135, A 


1315 


87. 5 


3 . 


2 


623 


15 


US -10-2 96 -115-1190 


Sequence 


1190, Ap 


1316 


87 . 5 


3 . 


2 


824 


16 


US- 10-437 -963 -19465 0 


Sequence 


194650, 


1317 


87.5 


3 . 


2 


859 


16 


US-10-437-963-186692 


Sequence 


186692, 


1318 


87. 5 


3. 


2 


896 


16 


US-10-437-963-180822 


Sequence 


180822, 


1319 


87.5 


3 . 


2 


1452 


16 


US-10-408-765A-83 


Sequence 


83, Appl 


1320 


87 


3 . 


1 


69 


9 


US-09-864-761-42081 


Sequence 42081, A 


1321 


87 


3 . 


1 


92 


15 


US- 10-424 -59 9 -18 8790 


Sequence 


188790, 



1322 




87 


3 


. l 


356 


14 


US-10-161-398-6 




Z\ nn 1 H 


1323 




87 


3 


. 1 


356 


14 


US-10-161-398-7 


JCL[UC11LC 


7 Arvrvl H 

/ , .tt.L/L)x X 


1324 




87 


3 


. 1 


356 


15 


US- 10-424 -599 -152601 




1 qOfiHl 


1325 




87 


3 


. 1 


365 


15 


US- 10-425- 114 -36617 




JODl / , J\ 


1326 




87 


3 


. l 


365 


15 


1 19-1 0 - 4 2 S - 1 14-^4709 

UJ ±,\J *± £i J X X *± D *± / \J J 


Sequence 


54709, A 


1327 




87 


3 


. 1 


488 


10 


US-09-882-227-42 


Sequence 


42, Appl 


1328 




87 


3 


. l 


709 


15 


US -10-424-S99-269771 


Sequence 


269773, 


1329 




87 


3 


. 1 


735 


16 


US -10-417-961-1 6 ?1 4A 


Sequence 


162148, 


1330 




87 


3 


. l 


960 


16 


US -10-417-961-1 07Dfi c i 


Sequence 


107065, 


1331 




87 


3 


. 1 


1160 


15 


US- 10-335-977 -6116 


Sequence 


6136, Ap 


1332 




87 


3 


. 1 


1167 


9 


US -09-815-242-11522 


Sequence 11522, A 


1333 




87 


3 


. 1 


1855 


14 


US -10-177-291-11 5 


Sequence 


315, App 


1334 


86 


. 5 


3 


. 1 


331 


15 


US -10-424-599-21 0 6 R R 


Sequence 210688, 


1335 


86 


. 5 


3 


. 1 


352 


9 


US -09-815-242-13614 


Sequence 13634, A 


1336 


86 


. 5 


3 


. 1 


359 


15 


US- 10 -282-1 2 2A-S1 RSI 


Sequence 


51853, A 


1337 


86 


. 5 


3 


. 1 


409 


15 


US -10-11S-977-R277 


Sequence 


8277, Ap 


1338 


86 


. 5 


3 


. 1 


412 


15 


US -10-335-977-R27R 


Sequence 


8278, Ap 


1339 


86 


. 5 


3 


. 1 


430 


15 


US -10-424-S99-277R0R 


Sequence 


277805, 


1340 


86 


. 5 


3 


. 1 


443 


15 


US -10-424-^99-147919 


Sequence 


147939, 


1341 


86 


. 5 


3 


. l 


498 


10 


US - 0 9 - 7 6 6 - S 1 1 R - 2 6 


Sequence 


26, Appl 


1342 


86 


. 5 


3 


. 1 


498 


14 


US -10-189- 12 3-51 

WW JL \J X. \J . X. -J X. 


Sequence 


51, Appl 


1343 


86 


. 5 


3 


. 1 


498 


14 


US-10-188-495-51 


Sequence 


51, Appl 


1344 


86 


. 5 


3 


, 1 


498 


15 


US-10-424-233-7 


Sequence 


7, Appli 


1345 


86 


. 5 


3 . 


. 1 


542 


15 


US -10-289-762-496 


Sequence 


496, App 


134 6 


86 


. 5 


3 . 


. 1 


591 


10 


US -09-766-51 IB -25 


Sequence 


25, Appl 


1347 


86 


. 5 


3 . 


. 1 


591 


14 


US -10-189- 12 3-50 


Sequence 


50, Appl 


1348 


86 


. 5 


3 . 


. 1 


591 


14 


US -10-188-495-50 


Sequence 


50, Appl 


1349 


86 


. 5 


3 . 


. 1 


591 


15 


US -10-424-233 -6 


Sequence 


6, Appli 


1350 


86 


.5 


3 . 


. 1 


597 


15 


US-10-424-233-9 


Sequence 


9, Appli 


1351 


86 


. 5 


3 . 


. 1 


613 


15 


US-10-369-493-12123 


Sequence 


12123, A 


1352 


86 


. 5 


3 . 


. 1 


622 


9 


US-09-764-881-119 


Sequence 119, App 


1353 


86 


.5 


3 . 


. 1 


622 


10 


US-09-764-881-119 


Sequence 


119, App 


1354 


86 


. 5 


3 . 


.1 


622 


10 


US-09-766-511B-23 


Sequence 


23, Appl 


1363 


86 


.5 


3 . 


. 1 


622 


14 


US-10-116-252-13 


Sequence 


13, Appl 


1423 


86 


.5 


3 . 


. 1 


622 


14 


US-10-189-123-48 


Sequence 


48, Appl 



Search completed: February 15, 2 005, 12:53:30 
Job time : 102 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: February 15, 2005, 12:33:38 ; Search time 122 Seconds 

(without alignments) 
2195.225 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS. 



. GKLLGMAVWWLRGARKVKET 523 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 1612378 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1500 summaries 

Database : UniProtJD3:* 

1: uniprot_sprot : * 
2: uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2768 


100.0 


523 


2 


Q6UXC4 


Q6uxc4 


homo sapien 


2 


2754 


99.5 


523 


2 


Q8NBP2 


Q8nbp2 


homo sapien 


3 


2198 


79.4 


523 


2 


Q6NUS8 


Q6nus8 


homo sapien 


4 


2193 


79.2 


..523 


2 


Q96DM6 


. Q96dm6 


homo sapien. 


5 


1815 


65.6 


523 


2 


Q8R0Y5 


Q8r0y5 


mus musculu 


6 


1804 


65.2 


523 


2 


Q8JZZ0 


Q8jzz0 


mus musculu 


7 


1802 


65.1 


523 


2 


Q8VC11 


Q8vcll 


m hypotheti 


8 


1145 


41.4 


523 


2 


Q63ZR6 


Q63zr6 


xenopus lae 


9 


963 


34.8 


221 


2 


Q8NAW4 


Q8naw4 


homo sapien 


10 


914.5 


33.0 


252 


2 


Q8IYS9 


Q8iys9 


homo sapien 


11 


792 


28.6 


302 


2 


Q8BRY7 


Q8bry7 


mus musculu 


12 


748.5 


27.0 


502 


1 


UDC1 RABIT 


P36514 


oryctolagus 


13 


727 


26.3 


541 


2 


Q98TB5 


Q98tb5 


gallus gall 


14 


714.5 


25.8 


527 


2 


Q6UWM9 


Q6uwm9 


homo sapien 


15 


699 


25.3 


541 


1 


CGT_HUMAN 


Q16880 


homo sapien 



16 


698 


25. 


2 


541 


2 


Q91W57 


17 


697 


25. 


2 


541 


1 


CGT_MOUSE 


18 


689.5 


24. 


9 


529 


2 


Q9TSL6 


19 


685.5 


24. 


8 


529 


2 


097951 


20 


685.5 


24. 


8 


534 


2 


Q8BWQ1 


21 


684.5 


24. 


7 


498 


2 


Q9BDZ8 


22 


684 


24. 


7 


541 


1 


CGT RAT 


23 


681 


24. 


6 


530 


2 


Q9R110 


24 


680.5 


24. 


6 


534 


2 


Q9D811 


25 


680 


24. 


6 


529 


2 


Q9GLD9 


26 


679 


24. 


5 


533 


1 


UD11_HUMAN 


27 


677.5 


24. 


5 


530 


1 


UDB2 RAT 


28 


677.5 


24. 


5 


530 


2 


Q6K1J1 


29 


677.5 


24. 


5 


534 


2 


Q8R129 


30 


676.5 


24. 


4 


533 


2 


Q6DHD1 


31 


675.5 


24. 


4 


528 


2 


Q6PDD0 


32 


674.5 


24. 


4 


530 


2 


Q7TT85 


33 


673.5 


24. 


3 


533 


2 


046548 


34^ 


673 


24. 


3 


530 


1 


UDBKJ4ACFA 


35 


672.5 


24. 


3 


449 


2 


Q9H6S4 


36 


670.5 


24. 


2 


528 


2 


Q80X89 


37 


670 


24 . 


2 


527 


2 


Q9ESE4 


38 


670 


24. 


2 


531 


1 


UDBD_RABIT 


39 


669 


24. 


2 


531 


2 


Q6T5F0 


40 


669 


24. 


2 


535 


1 


UD11 MOUSE 


41 


669 


24. 


2 


535 


2 


Q6XL50 


42 


668 


24. 


1 


529 


1 


UDB1 RAT 


43 


668 


24. 


1 


529 


2 


Q8R084 


44 


667.5 


24. 


1 


529 


2 


Q8VIF8 


45 


667 


24 . 


1 


533 


2 


046549 


46 


666.5 


24. 


1 


528 


2 


Q8WN97 


47 


665. 5 


24. 


0 


533 


2 


046423 


48 


665 


24. 


0 


531 


1 


UD15_RAT 


49 


665 


24. 


0 


533 


2 


Q9TSL7 


50 


664 . 5 


24. 


0 


528 


1 


UDBJ MAC FA 


51 


664 


24. 


0 • 


531 


2 


Q8VD45 


52 


663.5 


• 24. 


0 


529 


1 


UDB9 MAC FA 


53 


662.5 


23. 


9 


530 


1 


UDBE RABIT 


54 


662 


23. 


9 


533 


2 


Q95KM4 


55 


661.5 


23. 


9 


528 


1 


UDB4_HUMAN 


56 


661 


23. 


9 


528 


2 


Q8VIF9 


57 


661 


23. 


9 


533 


2 


Q6T5F2 


58 


660.5 


23. 


9 


529 


2 


Q9GLE0 


59 


658 


23. 


8 


533 


1 


UD12 RAT 


60 


658 


23. 


8 


533 


2 


Q9BDG7 


61 


658 


23. 


8 


535 


...1 . 


UD11_RAT 


62 


657 


23. 


7 


530 


1 


UDBH_HUMAN 


63 


654.5 


23. 


6 


529 


1 


UDB7_HUMAN 


64 


654.5 


23. 


6 


529 


2 


Q6XL4 8 


65 


651.5 


23. 


5 


525 


2 


Q76K67 


66 


651 


23. 


5 


527 


1 


UDA1_HUMAN 


67 


651 


23. 


5 


529 


2 


Q6DJ94 


68 


650.5 


23. 
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Q91w57 mus musculu 
Q64676 mus musculu 
Q9tsl6 macaca fasc 
097951 macaca fasc 
Q8bwql mus musculu 
Q9bdz8 bos taurus 
Q09426 rattus norv 
Q9rll0 cavia porce 
Q9d811 mus musculu 
Q9gld9 macaca mula 
P22309 homo sapien 
P08541 rattus norv 
Q6kljl canis famil 
Q8rl29 mus musculu 
Q6dhdl brachydanio 
Q6pdd0 mus musculu 
Q7tt85 rattus norv 

046548 felis silve 
077649 macaca fasc 
Q9h6s4 homo sapien 
Q80x89 mus musculu 
Q9ese4 mus musculu 
P36512 oryctolagus 
Q6t5f0 rattus norv 
Q6388 6 mus musculu 
Q6xl50 mus musculu 
P09875 rattus norv 
Q8r084 mus musculu 
Q8vif8 cavia porce 

046549 felis silve 
Q8wn97 macaca fasc 
046423 felis silve 
Q64638 rattus norv 
Q9tsl7 macaca fasc 
Q9xt55 macaca fasc 
Q8 vd4 5 rattus norv 
002663 macaca fasc 
P36513 oryctolagus 
Q95km4 macaca mula 
P06133 homo sapien 
Q8vif9 cavia porce 
Q6t5f2 rattus norv 
Q9gle0 macaca mula 
P20720 rattus norv 
Q9bdg7 macaca mula 
Q64550 rattus norv 
075795 homo sapien 
P16662 homo sapien 
Q6xl48 mus musculu 
Q76k67 mus musculu 
Q9y4xl homo sapien 
Q6dj 94 xenopus tro 
Q8kl54 mus musculu 
P36510 rattus norv 
019103 oryctolagus 
P54855 homo sapien 
Q6xl43 mus musculu 
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Q6t5fl rattus norv 
Q8bjl9 mus musculu 
P36537 homo sapien 
Q64637 rattus norv 
P19488 rattus norv 
P70691 mus musculu 
P19224 homo sapien 
P17717 mus musculu 
Q91wh2 m riken cdn 
Q8wuq4 homo sapien 
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Q9by64 homo sapien 
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Q68gl9 rattus norv 
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P08542 rattus norv 
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Q68g32 rattus norv 
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Q64634 rattus norv 
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P22310 homo sapien 
Q7sxe7 brachydanio 
P08430 rattus norv 
Q28611 oryctolagus 
Q75xxl pleuronecte 
Q62789 rattus norv 
Q9w710 pleuronecte 
Q9w711 pleuronecte 
Q7qc46 anopheles g 
Q7ptf6 anopheles g 
Q7q5t0 anopheles g 
Q7q5s9 anopheles g 
Q91280 pleuronecte 
Q7qir0 anopheles g 
Q6dfm6 xenopus tro 
P70624 rattus norv 
Q7z6h8 homo sapien 
Q7qgj3 anopheles g 
Q9vj81 drosophila 
096832 drosophila 
Q9vgt0 drosophila 
Q9w2j4 drosophila 
Q9xyn3 drosophila 
Q9vji0 drosophila 
Q7q3k0 anopheles g 
Q9vda5 drosophila 
Q965x5 caenorhabdi 
Q7pt89 anopheles g 
Q7q6n4 anopheles g 
Q63662 rattus norv 
Q9vgt5 drosophila 
Q9u3q6 caenorhabdi 
Q9vgs9 drosophila 
Q9xyn4 drosophila 
Q17399 caenorhabdi 
Q9vmgl drosophila 
Q8syl7 drosophila 
Q8szd9 drosophila 
Q9w2j3 drosophila 
Q7q4h4 anopheles g 
Q9vjh8 drosophila 
Q9vj46 drosophila 
001614 caenorhabdi 
Q9vgt3 drosophila 
016276 caenorhabdi 
Q9vj45 drosophila 
Q9vgt4 drosophila 
Q7qjn2 anopheles g 
Q9vj47 drosophila 
Q9vgt8 drosophila 
Q7pin9 anopheles g 
Q9txz4 caenorhabdi 
Q23323 caenorhabdi 
Q9vjh9 drosophila 
Q21603 caenorhabdi 
Q9vgt2 drosophila 
Q17813 caenorhabdi 
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262 


332.5 


12. 


0 


523 


2 


Q9J8A8 


Q9j8a8 


spodoptera 


263 


331.5 


12. 


0 


528 


1 


UDPE_NPVMB 


Q83140 


mamestra br 


264 


329 


11. 


9 


520 


2 


Q9WEV9 


Q9wev9 


ecotropis o 


265 


328. 5 


11. 


9 


516 


2 


Q8QLJ5 


Q8qlj5 


mamestra co 


266 


328.5 


11. 


9 


516 


2 


Q71AH3 


Q71ah3 


mamestra co 


267 


328.5 


11. 


9 


525 


2 


Q6X859 


Q6x859 


spodoptera 


268 


328 


11. 


8 


560 


2 


Q9YMK2 


Q9ymk2 


lymantria d 


269 


326 


11. 


8 


515 


2 


016988 


016988 


caenorhabdi 


270 


325.5 


11. 


8 


512 


2 


Q80LH4 


Q801h4 


adoxophyes 


271 


325.5 


11. 


8 


529 


2 


Q9W228 


Q9w228 


drosophila 


272 


325 


11. 


7 


463 


2 


Q7T5G5 


Q7t5g5 


cryptophleb 


273 


325 


11. 


7 


515 


2 


Q9E214 


Q9e214 


helicoverpa 


274 


324 . 5 


11. 


7 


446 


2 


Q91KX3 


Q91kx3 


epinotia ap 


275 


324 


11. 


7 


515 


2 


Q8JYS6 


Q8 jys6 


helicoverpa 


276 


323 


11. 


7 


515 


2 


Q99GT6 


Q99gt6 


helicoverpa 


277 


322.5 


11. 


7 . 


513 


2 


011454 


011454 


buzura supp 


278 


322 


11. 


6 


515 


2 


055264 


055264 


heliothis a 


279 


322 


11. 


6 


540 


2 


Q9GYR7 


Q9gyr7 


caenorhabdi 


280 


322 


11. 


6 


636 


2 


Q9WW2 


Q9ww2 


drosophila 


281 


321 


11. 


6 


515 


2 


041856 


041856 


helicoverpa 


282 


320 


11. 


6 


515 


2 


Q91BT6 


Q91bt6 


helicoverpa 


283 


318 


11. 


5 


527 . 


2 


016922 


016922 


caenorhabdi 


284 


317.5 


11. 


5 


541 


2 


045109 


045109 


caenorhabdi 


285 


317 


11. 


5 


593 


2 


017401 


017401 


caenorhabdi 


286 


316.5 


11. 


4 


488 


1 


UDPE_NPVLD 


P41713 


lymantria d 


287 


314.5 


11. 


.4 


507 


1 


YKT 6_CAEEL 


P34317 


caenorhabdi 


288 


313. 5 


11. 


3 


582 


2 


Q7QCS2 


Q7qcs2 


anopheles g 


289 


311 


11. 


2 


524 


1 


UGTC CAEEL 


Q22181 


caenorhabdi 


290 


304.5 


11. 


.0 


294 


2 


Q9H3F9 


Q9h3f9 


homo sapien 


291 


303.5 


11. 


, 0 


537 


2 


Q9NBD8 


Q9nbd8 


drosophila 


292 


300.5 


10. 


,9 


484 


2 


Q91ER4 


Q91er4 


cydia pomon 


293 


300.5 


10. 


,9 


542 


2 


016914 


016914 


caenorhabdi 


294 


296 


10. 


,1 


528 


2 


016915 


016915 


caenorhabdi 


295 


295.5 


10. 


,1 


383 


2 


Q9N5N0 


Q9n5n0 


caenorhabdi 


296 


292 


10. 


.5 


512 


1 


UGTB_CAEEL 


Q22180 


caenorhabdi 


297 


290.5 


10. 


.5 


583 


2 


Q9I7T3 


Q9i7t3 


drosophila 


298 


290 


10. 


,5 


429 


2 


Q9DVR5 


Q9dvr5 


plutella xy 


299 


283.5 


10, 


.2 


527 


2 


017123 


017123 


caenorhabdi 


300 


281 


10, 


.2 


448 


2 


Q999Y7 


Q999y7 


choristoneu 



301 


280 


10. 


1 


290 


2 


Q9R0A4 


302 


280 


10. 


1 


523 


2 


016916 


303 


279 


10. 


1 


493 


1 


UDPE_NPVCD 


304 


278 


10. 


0 


493 


2 


Q993S9 


305 


274. 5 


9. 


9 


580 


2 


016920 


306 


273.5 


9. 


9 


493 


2 


Q9VQT0 


307 


270 


9. 


8 


405 


2 


Q94260 


308 


270 


9. 


8 


434 


2 


Q8JRT0 


309 


267 


9. 


6 


445 


2 


Q910F9 


310 


267 


9. 


6 


445 


2 


Q77IS9 


311 


264.5. 


9. 


6 


501 


2 


016322 


312 


260.5 


9. 


4 


397 


2 


Q7QDP0 


313 


260 


9. 


4 


389 


2 


Q8DSH9 


314 


259. 5 


9. 


4 


489 


1 


UDPE NPVOP 


315 


256 


9. 


2 


310 


2 


Q13460 


316 


256 


9. 


2 


492 


2 


089808 


317 


256 


9 . 


2 


506 


2 


Q9GZD1 


318 


250 


9. 


0 


506 


2 


Q913H6 


319 


248 


9. 


0 


506 


2 


092382 


320 


247 . 5 


8 . 


9 


275 


2 


Q63841 


321 


246.5 


8 . 


9 


506 


1 


UDPE NPVAC 


322 


244 


8 . 


8 


581 


2 


P91326 


323 


242.5 


8 . 


8 


795 


2 


Q19222 


324 


242 


8 . 


7 


491 


1 


UDPE NPVCF 


325 


241 


8 . 


7 


285 


2 


Q13406 


326 


240. 5 


8 . 


7 


287 


2 


Q8WX91 


327 


240 


8 . 


7 


287 


2 


Q8WX90 


328 


240 


8 . 


7 


287 


2 


Q75XT5 


329 


238 . 5 


8 . 


6 


287 


2 


Q75XT3 


330 


238 


8 . 


6 


400 


2 


Q739H3 


331 


237 


8 . 


6 


284 


2 


Q9H3G0 


332 


237 


8 . 


6 


285 


2 


Q96PP1 


333 


236 


8 . 


5 


287 


2 


Q8WX89 


334 


234 


8 . 


5 


284 


2 


Q9H3G1 


335 


234 


8 . 


5 


285 


2 


Q96PN9 


336 


232 . 5 


8 . 


4 


506 


2 


Q8B9N4 


337 


232 


8 . 


4 


402 


2 


Q6HJP7 


338 


229 


8 . 


3 


284 


2 


Q9H3G2 


339 


229 


8 . 


3 


285 


2 


Q96PP0 


340 


228 . 5 


8 . 


3 


402 


2 


Q81RG4 


341 


227 . 5 


8 . 


2 


400 


2 


Q735L3 


342 


227 


8 . 


2 


317 


2 


Q13407 


343 


226 


8 . 


2 


285 


2 


Q64671 


344 


226 


8 . 


2 


402 


2 


Q63C86 


345 


225.5 


8 . 


1 


402 


2 


Q81EB6 


346 


225 


8 . 


1 


37.0... 


2 


Q8MNX7 


347 


223 


8 . 


1 


153 


1 


UD21 RAT 


348 


222.5 


8 . 


0 


283 


2 


Q9BG60 


349 


222.5 


8 . 


0 


370 


2 


Q8IA7 8 


350 


222 


8 . 


0 


287 


2 


Q8WX86 


351 


222 


8 . 


0 


287 


2 


Q8WX87 


352 


222 


8 . 


0 


287 


2 


Q8WX88 


353 


221.5 


8. 


0 


397 


2 


Q736S5 


354 


220 


7. 


9 


485 


2 


Q9Q3Y1 


355 


210.5 


7. 


6 


309 


2 


Q8T744 


356 


206 


7. 


4 


460 


2 


Q6T2C3 


357 


203.5 


7. 


4 


448 


2 


Q9AUV4 



Q9r0a4 mus musculu 
016916. caenorhabdi 
Q90158 choristoneu 
Q993s9 anticarsia 
016920 caenorhabdi 
Q9vqt0 drosophila 
Q94260 caenorhabdi 
Q8jrt0 phthorimaea 
Q910f9 adoxophyes 
Q77is9 adoxophyes 
016322 caenorhabdi 
Q7qdp0 anopheles g 
Q8dsh9 streptococc 
Q65363 orgyia pseu 
Q13460 homo sapien 
089808 epiphyas po 
Q9gzdl caenorhabdi 
Q913h6 bombyx mori 
092382 bombyx mori 
Q63841 mus sp. udp 
P18569 autographa 
P91326 caenorhabdi 
Q19222 caenorhabdi 
Q90157 choristoneu 
Q13406 homo sapien 
Q8wx91 homo sapien 
Q8wx90 homo sapien 
Q75xt5 homo sapien 
Q75xt3 homo sapien 
Q739h3 bacillus ce 
Q9h3g0 homo sapien 
Q96ppl homo sapien 
Q8wx8 9 homo sapien 
Q9h3gl homo sapien 
Q96pn9 homo sapien 
Q8b9n4 rachiplusia 
Q6hjp7 bacillus th 
Q9h3g2 homo sapien 
Q96pp0 homo sapien 
Q81rg4 bacillus an 
Q73513 bacillus ce 
Q13407 homo sapien 
Q64671 rattus norv 
Q63c86 bacillus ce 
Q81eb6 bacillus ce 

Q8mnx7 caenorhabdi 

P19489 rattus norv 
Q9bg60 mustela put 
Q8ia78 caenorhabdi 
Q8wx86 homo sapien 
Q8wx87 homo sapien 
Q8wx88 homo sapien 
Q736s5 bacillus ce 
Q9q3yl amsacta alb 
Q8t744 branchiosto 
Q6t2c3 streptqmyce 
Q9auv4 oryza sativ 



358 


202 


7.3 


397 


2 


Q81CG5 


359 


199 


7.2 


458 


2 


Q6W5Q9 


360 


198 


7.2 


482 


2 


Q6YZQ6 


361 


197.5 


7.1 


488 


2 


Q9LXV0 


362 


192 


6.9 


458 


2 


Q9EWA4 


363 


191.5 


6.9 


448 


2 


Q9SBQ3 


364 


190.5 


6.9 


431 


2 


Q7N615 


365 


190 


6.9 


511 


1 


LGT CITUN 


366 


189.5 


6.8 


446 


2 


Q83WG5 


367 


188.5 


6.8 


506 


2 


Q9L4W6 


368 


187 


6.8 


447 


2 


004114 


369 


187 


6.8 


461 


2 


Q8S3B9 


370 


187 


6.8 


4 62 


2 


Q9FUJ6 


371 


187 


6.8 


480 


2 


Q7XHR3 


372 


186.5 


6.7 


488 


2 


Q8VZF9 


373 


186.5 


6.7 


490 


2 


Q8H0V7 


374 


186 


6.7 


419 


2 


P72650 


375 


185. 5 


6.7 


455 


2 


Q9ZWS2 


376 


185.5 


6.7 


484 


2 


023400 


377 


185 


6.7 


143 


2 


Q98TP0 


378 


185 


6.7 


433 


1 


UFOG SOLME 


379 


184 . 5 


6.7 


453 


2 


Q9SYK9 


380 


184 


6.6 


462 


2 


Q8S998 


381 


182 . 5 


6. 6 


456 


2 


Q9AR43 


382 


182 . 5 


6. 6 


456 


2 


Q9AR4 5 


383 


. 182 


6.6 


271 


2 


Q9N962 


384 


182 


6. 6 


413 


1 


CRTX ERWHE 


385 


182 


6.6 


449 


2 


Q8LKT1 


386 


182 


6.6 


453 


1 


UFOG GENTR 


387 


181.5 


6. 6 


449 


2 


022822 


388 


181.5 


6. 6 


456 


2 


Q9AQV0 


389 


181 . 5 


6.6 


460 


2 


Q6VAA6 


390 


181. 5 


6.6 


479 


2 


023401 


391 


181 


6.5 


449 


2 


Q8LKT3 


392 


181 


6. 5 


555 


2 


Q66PF4 


393 


180.5 


6.5 


259 


2 


016507 


394 


180 . 5 


6.5 


288 


2 


Q8LM15 


395 


180 


6.5 


479 


2 


Q8LB44 


396 


180 


6.5 


482 


2 


Q69JH3 


397 


180 


6.5 


483 


2 


Q93NW9 


398 


179 


6.5 


565 


2 


Q9LSY5 


399 


178 . 5 


6.4 


456 


2 


022304 


400 


178 


6.4 


428 


2 


P95134 


401 


177 


6.4 


405 


2 


031853 


402 


177 


6.4 


457 


2 


Q9SKC1 


403 


177 


6.4 


460 


2 


Q6X1C0_ 


404 


177 


6.4 


481 


2 


Q8GS49 


405 


176. 5 


6.4 


1184 


2 


Q9SYK8 


406 


176 


6.4 


463 


2 


Q50458 


407 


175.5 


6.3 


396 


2 


Q65JC2 


408 


175.5 


6.3 


452 


2 


022303 


409 


175. 5 


6.3 


480 


1 


HQGT_ARATH 


410 


175 


6.3 


478 


2 


Q94IF2 


411 


175 


6.3 


555 


2 


Q84UE9 


412 


174.5 


6.3 


431 


2 


Q8VUJ7 


413 


174.5 


6.3 


456 


2 


Q9AVK6 


414 


174.5 


6.3 


456 


2 


Q9AVK7 



Q81cg5 bacillus ce 
Q6w5q9 streptomyce. 
Q6yzq6 oryza sativ 
Q91xv0 arabidopsis 
Q9ewa4 streptomyce 
Q9sbq3 petunia hyb 
Q7n615 photorhabdu 
Q9mb73 citrus unsh 
Q83wg5 streptomyce 
Q914w6 streptomyce 
004114 perilla fru 
Q8s3b9 glycine max 
Q9fuj6 phaseolus v 
Q7xhr3 oryza sativ 
Q8vzf9 arabidopsis 
Q8h0v7 arabidopsis 
P72650 synechocyst 
Q9zws2 vigna mungo 

023400 arabidopsis 
Q98tp0 platichthys 
Q43641 solanum mel 
Q9syk9 arabidopsis 
Q8s998 phaseolus a 
Q9ar43 vitis vinif 
Q9ar45 vitis vinif 
Q9n962 leishmania 
Q01330 erwinia her 
Q81ktl sorghum bic 
Q96493 gentiana tr 
022822 arabidopsis 
Q9aqv0 vitis vinif 
Q6vaa6 stevia reba 

023401 arabidopsis 
Q81kt3 sorghum bic 
Q66pf4 fragaria an 
016507 caenorhabdi 
Q81ml5 oryza sativ 
Q81b44 arabidopsis 
Q69jh3 oryza sativ 
Q93nw9 streptomyce 
Q91sy5 arabidopsis 
022304 vitis vinif 
P95134 mycobacteri 
031853 bacillus su 
Q9skcl arabidopsis 
Q6xlc0 crocus sati 
Q8gs49 oryza sativ 
Q9syk8 arabidopsis 
Q50458 mycobacteri 
Q65jc2 bacillus li 
022303 vitis vinif 
Q9ml56 arabidopsis 
Q94if2 nicotiana t 
Q84ue9 fragaria an 
Q8vuj7 pantoea agg 
Q9avk6 vitis labru 
Q9avk7 vitis labru 



415 


174 . 5 


6.3 


482 


2 


Q6Z688 


416 


. 174 . 5 


6.3 


490 


2 


Q6AUC5 


417 


174 . 5 


6.3 


496 


2 


Q6Z481 


418 


174 


6.3 


402 


2 


Q8KUH5 


419 


173. 5 


6.3 


431 


1 


CRTX_PANAN 


420 


173. 5 


6.3 


458 


2 


Q7XI34 


421 


172 . 5 


6.2 


390 


2 


Q8S9A8 


422 


172 . 5 


6.2 


462 


2 


Q8S3B5 


423 


172 


6.2 


394 


2 


Q62YM9 


424 


172 


6.2 


468 


2 


Q9SBQ2 


425 


172 


6.2 


476 


2 


Q9LML7 


426 


171 . 5 


6.2 


464 


2 


Q8S3B8 


427 


171 


6.2 


452 


2 


048715 


428 


171 


6. 2 


467 


2 


082383 


429 


171 


6.2 


473 


2 


Q8LKG3 


430 


171 


6.2 


474 


2 


Q6R129 


431 


171 


6.2 


478 


2 


Q942C4 


432 


170. 5 


6.2 


450 


2 


Q89RS0 


433 


170.5 


6.2 


471 


2 


Q942C3 


434 


170 . 5 


6.2 


479 


2 


Q94IF1 


435 


169. 5 


6. 1 


449 


2 


Q9LS16 


436 


169 . 5 


6. 1 


454 


2 


Q9XF16 


437 


169. 5 


6. 1 


456 


2 


Q8LFR6 


438 


169 . 5 


6. 1 


459 


2 


Q9LFK0 


439 


lf>9 . 5 


6. 1 


483 


2 


Q6VAA4 


440 


169 . 5 


6. 1 


487 


2 


Q9LME8 


441 


169 


6. 1 


395 


2 


Q65N79 


442 


169 


6. 1 


456 


2 


022182 


443 


169 


6. 1 


457 


2 


Q9AUV3 


444 


168 . 5 


6. 1 


456 


2 


Q9SKC5 


445 


168 . 5 


6 . 1 


492 


2 


08LM13 


446 


168 


6. 1 


470 


2 


Q8S9A4 


447 


168 


6. 1 


478 


2 


Q66PF3 


448 


167 . 5 


6. 1 


391 


2 


Q8L5N2 


449 


167 . 5 


6. 1 


397 


2 


Q8KNE0 


450 


167 . 5 


6. 1 


475 


2 


023402 


451 


167 . 5 


6. 1 


479 


2 


Q9LSY6 


452 


167 


6. 0 


93 


2 


Q9NS17 


453 


167 


6.0 


474 


2 


Q8S9A6 


454 


166. 5 


6.0 


462 


2 


Q7XWK3 


455 


• 166. 5 


6.0 


466 


2 


Q66PF5 


456 


166. 5 


6. 0 


483 


2 


Q9XES4 


457 


166. 5 


6.0 


484 


2 


Q6ER37 


458 


166 


6.0 


280 


2 


Q9ZWQ4 


459 


166 


6.0 


460 


2 


048676 


460 


166 


6.0 


465 


2 


Q8LKT4 


461 


166 


6. 0 


518 


2 


Q7XWV5 


462 


165. 5 


6.0 


407 


2 


Q9S1V2 


463 


165. 5 


6.0 


451 


2 


Q9SBQ8 


464 


165.5 


6.0 


462 


2 


004622 


465 


165 . 5 


6. 0 


477 


2 


Q9ZWQ5 


466 


165 


6.0 


445 


2 


Q7XHS6 


467 


165 


6.0 


485 


2 


Q6VAA8 


468 


165 


6.0 


493 


2 


Q6EP59 


469 


164.5 


5.9 


435 


2 


Q98EL2 


470 


164.5 


5.9 


449 


2 


022820 


471 


164.5 


5.9 


461 


2 


Q9ZR25 



Q6z688 oryza sativ 
Q6auc5 oryza sativ 
Q6z481 oryza sativ 
Q8kuh5 actinosynne 
P21686 pantoea ana 
Q7xi34 oryza sativ 
Q8s9a8 phaseolus a 
Q8s3b5 phaseolus 1 
Q62ym9 bacillus li 
Q9sbq2 petunia hyb 
Q91ml7 arabidopsis 
Q8s3b8 glycine max 
048715 arabidopsis 
082383 arabidopsis 
Q81kg3 stevia reba 
Q6rl29 citrus sine 
Q942c4 oryza sativ 
Q89rs0 bradyrhizob 
Q942c3 oryza sativ 
Q94ifl nicotiana t 
Q91sl6 arabidopsis 
Q9xfl6 forsythia i 
Q81f r6 arabidopsis 
Q91fk0 arabidopsis 
Q6vaa4 stevia reba 
Q91me8 arabidopsis 
Q65n79 bacillus li 
022182 arabidopsis 
Q9auv3 oryza sativ 
Q9skc5 arabidopsis 
Q81ml3 oryza sativ 
Q8s9a4 phaseolus a 
Q66pf3 fragaria an 
Q815n2 malus domes 
Q8kne0 micromonosp 
023402 arabidopsis 
Q91sy6 arabidopsis 
Q 9ns 17 homo sapien 
Q8s9a6 phaseolus a 
Q7xwk3 oryza sativ 
Q66pf5 fragaria an 
Q9xes4 malus domes 
Q6er37 oryza sativ 
Q9zwq4 vigna mungo 
048676 arabidopsis 
Q81kt4 sorghum bic 
Q7xwv5 oryza sativ 
Q9slv2 streptomyce 
Q9sbq8 petunia hyb 
004622 arabidopsis 
Q9zwq5 vigna mungo 
Q7xhs6 oryza sativ 
Q6vaa8 stevia reba 
Q6ep59 oryza sativ 
Q98el2 rhizobium 1 
022820 arabidopsis 
Q9zr25 verbena x h 



472 


164.5 


5. 


9 


462 


2 


Q8S3B6 


473 


164.5 


5. 


9 


472 


2 


.Q9LFJ9 


474 


164.5 


5. 


9 


492 


2 


Q7XKH9 


475 


164.5 


5. 


9 


500 


2 


Q69JH2 


476 


164 


5. 


9 


458 


2 


Q8RU64 


All 


164 


5. 


9 


471 


1 


IAAG MAIZE 


478 


164 


5. 


9 


478 


2 


Q9S7R8 


479 


164 


5. 


9 


487 


2 


Q94A84 


480 


164 


5. 


9 


487 


2 


Q7XJ52 


481 


164 


5. 


9 


496 


2 


Q7XTG9 


482 


163.5 


5. 


9 


457 


2 


023205 


483 


163^5 


5. 


9 


470 


2 


Q8RU71 


484 


163.5 


5. 


9 


471 


1 


UF01_MAIZE 


485 


163.5 


5. 


9 


471 


2 


Q84XC2 


486 


163 


5. 


9 


361 


2 


Q6X3X7 


487 


163 


5. 


9 


361 


2 


Q6X3X8 


488 


163 


5. 


9 


452 


2 


Q76G23 


489 


163 


5. 


9 


459 


1 


ZOG PHALU 


490 


163 


5. 


9 


486 


2 


Q84ZE0 


491 


163 


5. 


9 


490 


2 


Q7XHS0 


492 


162.5 


5. 


9 


404 


2 * 


Q62XT4 


493 


162 . 5 


5. 


9 


406 


2 


Q65ME1 


494 


162.5 


5. 


9 


420 


2 


022603 


495 


162.5 


5. 


9 


453 


2 


Q9SNB2 


496 


162 . 5 


5. 


9 


457 


2 


Q8LKT6 


497 


162 . 5 


5. 


9 


458 


2 


Q94AB5 


498 


162.5 


5. 


9 


466 


2 


Q947K4 


499 


162.5 


5. 


9 


484 


2 


Q8LJ11 


500 


162 


5. 


9 


464 


2 


Q9M051 


501 


162 


5. 


9 


485 


2 


Q6Z684 


502 


161.5 


5. 


8 


450 


2 


Q9FIA0 


503 


161 . 5 


5 . 


8 


471 


2 


Q8S995 


504 


161.5 


5. 


8 


487 


2 


Q7XSX9 


505 


161 


5. 


8 


454 


1 


ZOX_PHAVU 


506 


161 


5. 


8 


459 


2 


Q9M6E7 


507 


161 


5. 


8 


463 


2 


Q8H3T8 


508 


161 


5. 


8 


474 


2 


082382 


509 


161 


5. 


8 


476 


2 


Q7XTH0 


510 


161 


5. 


8 


1198 


2 


Q06321 


511 


160.5 


5. 


8 


472 


2 


Q8LR91 


512 


160.5 


5. 


8 


495 


2 


Q8S9A0 


513 


160 


5. 


8 


356 


2 


Q6X3X9 


514 


160 


5. 


8 


361 


2 


Q6X3Y0 


515 


160 


5. 


8 


383 


2 


Q9LRE4 


516 


160 


5. 


8 


432 


2 


Q7N614 


517 ... 


160 


5. 


8 


480 


2 


Q6F4D6 


518 


159.5 


5. 


8 


277 


2 


Q7XFQ3 


519 


159.5 


5. 


8 


286 


2 


Q8S657 


520 


159.5 


5. 


8 


286 


2 


Q7XFW0 


521 


159.5 


5. 


8 


392 


2 


Q737I4 


522 


159.5 


5. 


8 


404 


2 


Q6W222 


523 


159.5 


5. 


8 


467 


2 


Q8LC96 


524 


159.5 


5. 


8 


469 


2 


Q69IV0 


525 


159.5 


5. 


8 


478 


2 


023382 


526 


159 


5. 


,7 


285 


2 


Q8T0U7 


527 


159 


5. 


.7 


287 


1 


UF07 MANES 


528 


159 


5. 


.7 


445 


2 


Q69TH5 



Q8s3b6 phaseolus 1 
Q91fj9 arabidopsis 
Q7xkh9 oryza sativ 
Q69jh2 oryza sativ 
Q8ru64 oryza sativ 
Q41819 zea mays (m 
Q9s7r8 arabidopsis 
Q94a84 arabidopsis 
Q7xj52 allium cepa 
Q7xtg9 oryza sativ 
023205 arabidopsis 
Q8ru71 nicotiana t 
P16166 zea mays (m 
Q84xc2 zea mays (m 
Q6x3x7 ipomoea pur 
Q6x3x8 ipomoea nil 
Q76g23 aralia cord 
Q9zsk5 phaseolus 1 
Q84ze0 oryza sativ 
Q7xhs0 oryza sativ 
Q62xt4 bacillus li 
Q65mel bacillus li 
022603 ipomoea pur 
Q9snb2 arabidopsis 
Q81kt6 sorghum bic 
Q94ab5 arabidopsis 
Q947k4 brassica na 
Q81jll oryza sativ 
Q9m051 arabidopsis 
Q6z684 oryza sativ 
Q9fia0 arabidopsis 
Q8s995 phaseolus a 
Q7xsx9 oryza sativ 
P56725 phaseolus v 
Q9m6e7 nicotiana t 
Q8H3t8 oryza sativ 
082382 arabidopsis 
Q7xth0 oryza sativ 
Q06321 saccharomyc 
Q81r91 oryza sativ 
Q8s9a0 phaseolus a 
Q6x3x9 ipomoea tri 
Q6x3y0 ipomoea hed 
Q91re4 ipomoea bat 
Q7n614 photorhabdu 
Q6f4d6 catharanthu 
Q7xfq3 oryza sativ 
Q8s657 oryza sativ 
Q7xfw0 oryza sativ 
Q737i4 bacillus ce 
Q6w222 rhizobium s 
Q81c96 arabidopsis 
Q69iv0 oryza sativ 
023382 arabidopsis 
Q8t0u7 drosophila 
Q40289 manihot esc 
Q69th5 oryza sativ 



529 


159 


5.7 


460 


2 


Q9LFJ8 


530 


159 


5.7 


474 


2 


Q942B6 


531 


158.5 


5.7 


156 


2 


Q96DH9 


532 


158.5 


5.7 


470 


1 


HQGT_RAUSE 


533 


158.5 


5.7 


471 


1 


UF02 MAIZE 


534 


158.5 


5.7 


471 


2 


Q8W1D2 


535 


158.5 


5.7 


473 


2 


Q94IP3 


536 


158.5 


5.7 


490 


2 


Q6Z473 


537 


158 


5.7 


301 


2 


Q705T5 


538 


158 


5.7 


485 


2 


Q7X709 


539 


158 


5.7 


487 


2 


Q7XJ51 


540 


158 


5.7 


491 


2 


Q7XV42 


541 


158 


5.7 


496 


2 


Q9ZQ97 


542 


1.57.5 


5.7 


435 


2 


Q9CD88 


543 


157 . 5 


5.7 


467 


2 


082385 


544 


157 . 5 


5.7 


471 


1 


UF03 MAIZE 


545 


157 . 5 


5.7 


471 


2 


Q8GSG7 


546 


157 . 5 


5.7 


504 


2 


Q851J7 


547 


157 . 5 


5.7 


535 


2 


Q7XKK4 


548 


157 


5.7 ' 


317 


2 


016510 


549 


157 


5.7 


461 


2 


Q9XC67 


550 


157 


5.7 


486 


2 


Q6Z689 


551 


157 


5.7 


496 


2 


Q7XKI0 


552 


156. 5 


5.7 


442 


2 


081010 


553 


156.5 


5.7 


463 


2 


Q8S997 


554 


156.5 


5.7 


490 


2 


Q7XW22 


555 


156.5 


5 . 7 


503 


2 


Q7F9N6 


556 


156 


5.6 


406 


2 


Q97TQ3 


557 


156 


5 . 6 


466 


2 


Q7XT97 


558 


156 


5 . 6 


473 


2 


Q9LSY9 


559 


156 


5 . 6 


479 


2 


Q942C5 


560 


155 . 5 


5. 6 


392 


2 


034539 


561 


155.5 


5.6 


464 


2 


Q8W2B6 


562 


155. 5 


5.6 


496 


2 


Q8RX23 


563 


155.5 


5.6 


496 


2 


Q9LVF0 


564 


155 


5.6 


301 


2 


Q705S7 


565 


155 


5.6 


470 


2 


Q9ZQ54 


566 


154.5 


5.6 


431 


2 


Q47843 


567 


154.5 


5.6 


478 


2 


Q942B8 


568 


154.5 


5.6 


485 


2 


Q8S999 


569 


154 . 5 


5.6 


492 


2 


Q8S7R8 


570 


154 


5.6 


362 


2 


Q6X3X1 


571 


154 


5.6 


421 


2 


Q8KHE4 


572 


154 


5.6 


452 


2 


Q8GVE3 


573 


154 


5.6 


495 


2 


Q9ZQ94 


574 


153.5 


. 5.5 


301 


2 


Q705R8 


575 


153. 5 


5.5 


301 


2 


Q705T0 


576 


153.5 


5.5 


447 


2 


Q9C768 


577 


153.5 


5.5 


469 


2 


Q9LR44 


578 


153.5 


5.5 


478 


2 


Q8LED6 


579 


153.5 


5.5 


478 


2 


Q65XD0 


580 


153.5 


5.5 


487 


1 


UF05_MANES 


581 


153.5 


5.5 


487 


2 


Q6Z485 


582 


153 


5.5 


451 


2 


Q9SNB1 


583 


153 


5.5 


490 


2 


Q6I5X0 


584 


153 


5.5 


496 


2 


Q9ZQ96 


585 


152.5 


5.5 


301 


2 


Q705R7 



Q91fj8 arabidopsis 
Q942b6 oryza sativ 
Q96dh9 homo sapien 
Q9ar73 rauvolfia s 
P16165 zea mays (m 
Q8wld2 zea mays (m 
Q94ip3 solanum sog 
Q6z473 oryza sativ 
Q705t5 arabidopsis 
Q7x709 oryza sativ 
Q7xj51 allium cepa 
Q7xv42 oryza sativ 
Q9zq97 arabidopsis 
Q9cd88 mycobacteri 
082385 arabidopsis 
P16167 zea mays (m 
Q8gsg7 zea mays (m 
Q851j7 oryza sativ 
Q7xkk4 oryza sativ 
016510 caenorhabdi 
Q9xc67 streptomyce 
Q6z689 oryza sativ 
Q7xki0 oryza sativ 
081010 arabidopsis 
Q8s997 phaseolus a 
Q7xw22 oryza sativ 
Q7f9n6 oryza sativ 
Q97tq3 Clostridium 
Q7xt97 oryza sativ 
Q91sy9 arabidopsis 
Q942c5 oryza sativ 
034539 bacillus su 
Q8w2b6 zea mays (m 
Q8rx23 arabidopsis 
Q91vf0 arabidopsis 
Q7 05s7 arabidopsis 
Q9zq54 arabidopsis 
Q47843 pantoea agg 
Q942b8 oryza sativ 
Q8s999 phaseolus a 
Q8s7r8 oryza sativ 
Q6x3xl ipomoea alb 
Q8khe4 nocardia ae 
Q8gve3 citrus maxi 
Q9zq94 arabidopsis 
Q705r8 arabidopsis 
Q705t0 arabidopsis 
Q9c768 arabidopsis 
Q91r44 arabidopsis 
Q81ed6 arabidopsis 
Q65xd0 oryza sativ 
Q40287 manihot esc 
Q6z485 oryza sativ 
Q9snbl arabidopsis 
Q6i5x0 oryza sativ 
Q9zq96 arabidopsis 
Q705r7 arabidopsis 



586 


152.5 


5.5 


329 


2 


Q7XWK6 


587 


152.5 


5.5 


401 


2 


. Q8KNC3 


588 


152. 5 


5.5 


459 


2 


Q8LKT5 


589 


152.5 


5.5 


469 


2 


Q69UF5 


590 


152 . 5 


5.5 


470 


2 


Q9ASY6 


591 


152 . 5 


5.5 


495 


2 


Q8GYL0 


592 


152 . 5 


5.5 


495 


2 


Q9ZQ95 


593 


152 


5.5 


472 


2 


Q65X83 


594 


152 


5.5 


495 


2 


Q7SFJ4 


595 


151. 5 


5.5 


301 


2 


Q705S0 


596 


151. 5. 


5.5 


301 


2 


Q705S9 


597 


151. 5 


5.5 


438 


2 


Q9CD91 


598 


151. 5 


5.5 


450 


2 


Q9FI96 


599 


151. 5 


5.5 


478 


2 


Q9ZVX4 


600 


151. 5 


5 . 5 


497 


2 


Q9FYU7 


601 


151 . 5 


5 . 5 


519 


2 


Q6H7 Jl 


602 


151 


5 . 5 


451 


2 


Q8LFF5 


603 


151 


5 . 5 


451 


2 


Q9FI98 


604 


151 


5 . 5 


453 


2 


Q9S9P6 


605 


151 


5 . 5 


457 


2 


08LKT2 


606 


151 


5 . 5 


470 


2 


Q940V3 


607 


151 


5 . 5 


489 


2 


Q9M9E7 


608 


150 . 5 


5 . 4 


301 


2 


Q705R9 


\j \j j 


150 . 5 


5 . 4 


301 


2 


O705S1 


610 


150. 5 


5.4 


301 


2 


Q705S2 


611 


150 . 5 


5 . 4 


301 


2 


Q705S3 


61 ? 


1 SO S 


5 . 4 


423 


2 


07MEX1 


61 3 


150 


5 . 4 


478 


2 


08W3P8 


614 


149 . 5 


5 . 4 


449 


2 


P95130 


615 


149 . 5 


5 . 4 


44 9 


2 


Q7TXJ4 


616 


149.5 


5 . 4 


461 


2 


06VAA7 

W V jJ**V f 


617 


149.5 


5 . 4 


466 


2 


043526 


618 


149 . 5 


5 . 4 


470 


2 


Q7XWK2 


619 


149.5 


5 . 4 


487 


2 


06F4D5 


6?0 


149.5 


5 . 4 


48 9 


2 


09SK82 


6?1 

\J *L J. 


1 4 9 S 


5 . 4 


1574 


2 


O6RN88 


622 


149 


5 . 4 


392 


2 


Q6HI79 


623 


149 


5 . 4 


507 


2 


Q9SCP6 


624 


149 


5.4 


520 


2 


Q6AT17 


625 


148 . 5 


5.4 


301 


2 


Q705R4 


626 


148 . 5 


5 . 4 


301 


2 


Q705U2 


627 


148 . 5 


5 . 4 


347 


2 


Q8L5C7 


628 


148 . 5 


5 . 4 


412 


2 


Q9S0P6 


629 


148 . 5 


5.4 


417 


2 


Q84WC5 


630 


148 . 5 


5 . 4 


438 


2 


09SNB3 


631 


148 . 5 


5 . 4 .... 


4 64 


2 


Q9FI99 


632 


148 . 5 


5 . 4 


472 


2 


Q65X84 


633 


148.5 


5 . 4 


485 


2 


Q9LSY8 


634 


148 


5.3 


474 


2 


Q6Z478 


635 


148 


5.3 


485 


2 


Q84ZE4 


636 


148 


5.3 


559 


2 


Q8S996 


637 


147 . 5 


5 . 3 


301 


2 


Q705R3 


638 


147.5 


5.3 


301 


2 


Q705U1 


639 


147.5 


5.3 


442 


2 


Q9M0P3 


640 


147.5 


5.3 


455 


2 


Q9ZVY5 


641 


147.5 


5.3 


460 


2 


Q9AUV2 


642 


147.5 


5.3 


466 


2 


Q9LSM0 



Q7xwk6 oryza sativ 
Q8knc3 micromonosp 
Q81kt5 sorghum bic 
Q69uf5 oryza sativ 
Q9asy6 arabidopsis 
Q8gyl0 arabidopsis 
Q9zq95 arabidopsis 
Q65x83 oryza sativ 
Q7sfj4 neurospora 
Q705s0 arabidopsis 
Q705s9 arabidopsis 
Q9cd91 mycobacteri 
Q9fi96 arabidopsis 
Q9zvx4 arabidopsis 
Q9fyu7 brassica na 
Q6h7jl oryza sativ 
Q81ff5 arabidopsis 
Q9f i98 arabidopsis 
Q9s9p6 arabidopsis 
Q81kt2 sorghum bic 
Q940v3 arabidopsis 
Q9m9e7 arabidopsis 
Q705r9 arabidopsis 
Q705sl arabidopsis 
Q705s2 arabidopsis 
Q705s3 arabidopsis 
Q7nexl gloeobacter 
Q8w3p8 phaseolus a 
P95130 mycobacteri 
Q7txj4 mycobacteri 
Q6vaa7 stevia reba 
Q43526 lycopersico 
Q7xwk2 oryza sativ 
Q6f4d5 catharanthu 
Q9sk82 arabidopsis 
Q6bn88 debaryomyce 
Q6hi79 bacillus th 
Q9scp6 arabidopsis 
Q6atl7 oryza sativ 
Q705r4 arabidopsis 
Q705u2 arabidopsis 
Q815c7 pisum sativ 
Q9s0p6 streptomyce 
Q84wc5 arabidopsis 
Q9snb3 arabidopsis 
Q9fi99 arabidopsis 
Q65x84 oryza sativ 
Q91sy8 arabidopsis 
Q6z478 oryza sativ 
Q84ze4 oryza sativ 
Q8s996 phaseolus a 
Q705r3 arabidopsis 
Q705ul arabidopsis 
Q9m0p3 arabidopsis 
Q9zvy5 arabidopsis 
Q9auv2 oryza sativ 
Q91sm0 arabidopsis 



643 


147.5 


5.3 


474 


2 


004930 


004930 


arabidopsis 


644 


147.5 


5.3 


479 


2 


Q9LMF0. 


Q91mf0 


arabidopsis 


645 


147.5 


5.3. 


480 


2 


Q9LSY4 


Q91sy4 


arabidopsis 


646 


147.5 


5.3 


481 


2 


082381 


082381 


arabidopsis 


647 


147 


5.3 


453 


2 


Q8LAI9 


Q81ai9 


arabidopsis 


648 


147 


5.3 


495 


2 


Q6ESW8 


Q6esw8 


oryza sativ 


649 


146.5 


5.3 


301 


2 


Q705T4 


Q705t4 


arabidopsis 


650 


146.5 


5.3 


301 


2 


Q705T9 


Q705t9 


arabidopsis 


651 


146. 5 


5.3 


418 


2 


Q93Z19 


Q93zl9 


arabidopsis 


652 


146.5 


5.3 


424 


2 


Q6VAB1 


Q6vabl 


stevia reba 


653 


146.5 


5.3 


447 


2 


Q8GWA0 


Q8gwa0 


arabidopsis 


654 


146.5 


5.3 


452 


2 


Q9STE3 


Q9ste3 


arabidopsis 


655 


146.5 


5.3 


458 


2 


023406 


023406 


arabidopsis 


656 


146.5 


5.3 


461 


2 


Q6ER38 


Q6er38 


oryza sativ 


657 


146.5 


5.3 


466 


2 


Q8GX09 


Q8gx09 


arabidopsis 


658 


146.5 


5.3 


476 


2 


P93364 


P93364 


nicotiana t 


659 


146.5 


5.3 


476 


2 


Q9AT53 


Q9at53 


nicotiana t 


660 


146.5 


5.3 


496 


2 


Q9ZQ98 


Q9zq98 


arabidopsis 


661 


146.5 


5.3 


504 


2 


Q6K2W8 


Q6k2w8 


oryza sativ 


662 


146 


5.3 


359 


2 


Q8GYB0 


Q8gvb0 


arabidopsis 


663 


146 


5.3 


452 


2 


023380 


023380 


arabidopsis 


664 


146 


5.3 


475 


2 


Q6WFW1 


Q6wfwl 


crocus sati 


665 


146 


5.3 


480 


2 


Q9FTW7 


Q9ftw7 


oryza sativ 


666 


146 


5.3 


481 


2 


Q7XI35 


Q7xi35 


oryza sativ 


667 


146 


5.3 


491 


2 


Q9ZQ99 


Q9zq99 


arabidopsis 


668 


146 


5.3 


772 


2 


Q75I83 


Q75i83 


oryza sativ 


669 


145. 5 


5.3 


301 


2 


Q705T6 


Q705t6 


arabidopsis 


670 


145. 5 


5.3 


481 . 


2 


Q9LNI1 


Q91nil 


arabidopsis 


671 


145 


5.2 


274 


2 


Q8S9A2 


Q8s9a2 


phaseolus a 


672 


145 


5.2 


428 


2 


Q6ZHS1 


Q6zhsl 


oryza sativ 


673 


145 


5.2 


453 


2 


022186 


022186 


arabidopsis 


674 


145 


5.2 


468 


2 


Q69IU8 


Q69iu8 


oryza sativ 


675 


145 


5.2 


479 


2 


Q8H3X8 


Q8h3x8 


oryza sativ 


676 


145 


5.2 


480 


2 


Q6QDB6 


Q6qdb6 


rhodiola sa 


677 


145 


5.2 


494 


2 


Q6K755 


Q6k755 


oryza sativ 


678 


144. 5 


5.2 


446 


2 


Q6D1R8 


Q6dlr8 


erwinia car 


679 


144.5 


5.2 


481 


2 


Q8W237 


Q8w237 


dorotheanth 


680 


144. 5 


5.2 


510 


2 


Q8LJZ7 


Q81 jz7 


sorghum bic 


681 


144.5 


5.2 


571 


2 


Q8GRS8 


Q8grs8 


oryza sativ 


682 


144 


5.2 


392 


2 


Q63AU6 


Q63au6 


bacillus ce 


683 


144 


5.2 


407 


2 


Q97FM0 


Q97fm0 


Clostridium 


684 


144 


5.2 


454 


2 


Q69TJ1 


Q69t jl 


oryza sativ 


685 


144 


5.2 


460 


2 


Q9ZR27 


Q9zr27 


perilla fru 


686 


144 


5.2 


468 


2 


Q6VAA5 


Q6vaa5 


stevia reba 


687 


144 


5.2 


479 


2 


Q6VAB3 


Q6vab3 


stevia reba 


688 


144 


5.2 


48L. 


JZ 


Q8W4G1 


Q8w4gl 


arabidopsis 


689 


144 


5.2 


498 


2 


Q6ESW3 


Q6esw3 


oryza sativ 


690 


143.5 


5.2 


287 


2 


Q8L6L3 


Q81613 


hordeum vul 


691 


143.5 


5.2 


451 


2 


Q8RXA4 


Q8rxa4 


lycopersico 


692 


143.5 


5.2 


462 


2 


Q94BM9 


Q94bm9 


arabidopsis 


693 


143. 5 


5.2 


479 


2 


Q9LML6 


Q91ml6 


arabidopsis 


694 


143.5 


5.2 


485 


2 


Q8S9A7 


Q8s9a7 


phaseolus a 


695 


143.5 


5.2 


493 


2 


Q69XD3 


Q69xd3 


oryza sativ 


696 


143.5 


5.2 


502 


2 


Q67W01 


Q67w01 


oryza sativ 


697 


143.5 


5.2 


524 


2 


Q9LGG2 


Q91gg2 


oryza sativ 


698 


143 


5.2 


453 


2 


Q9FN26 


Q9fn26 


arabidopsis 


699 


143 


5.2 


462 


2 


Q8L9U9 


Q819u9 


arabidopsis 



700 


143 


5.2 


469 


2 


Q9LNI4 


701 


143 


5.2 


480 


. 2 


Q8W4C2 


702 


143 


5.2 


481 


2 


Q9ZWJ3 


703 


142.5 


5.1 


414 


2 


Q9S595 


704 


142.5 


5.1 


452 


2 


Q8GZ65 


705 


142.5 


5.1 


468 


2 


Q681W3 


706 


142.5 


5.1 


476 


2 


Q9AT54 


707 


142.5 


5.1 


477 


2 


Q6Z035 


708 


142.5 


5.1 


477 


2 


Q6ZF65 


709 


142.5 


5.1 


1307 


2 


Q6C8M8 


710 


142 


5.1 


424 


2 


068841 


711 


142 


5.1 


482 


2 


Q8H0F2 


712 


141.5 


5.1 


333 


2 


Q81CW9 


713 


141. 5 


5.1 


449 


2 


Q9SNB0 


714 


141.5 


5.1 


476 


2 


P93365 


715 


141.5 


5.1 


489 


2 


Q9SMG6 


716 


141.5 


5.1 


492 


2 


Q7XSZ0 


717 


141 


5.1 


149 


2 


Q99912 


718 


141 


5.1 


394 


1 


UF06 MANES 


719 


141 


5.1 


462 


2 


Q9LK73 


720 


141 


5.1 


472 


2 


Q75I86 


721 


141 


5.1 


480 


2 


Q65X86 


722 


141 


5.1 


481 


2 


081498 


723 


141 


5.1 


482 


2 


Q65YR5 


724 


141 


5.1 


515 


2 


Q69KM1 


725 


140.5 


5.1 


346 


1 


UF02JVIANES 


726 


140.5 


5.1 


449 


2 


Q9LTH2 


727 


140.5 


5.1 


459 


2 


Q8W2B7 


728 


140.5 


5.1 


463 


2 


Q767C8 


729 


140.5 


5.1 


486 


2 


Q9LEQ4 


730 


140. 5 


5.1 


507 


2 


Q9FTW1 


731 


140 


5.1 


478 


2 


Q8H0F1 


732 


140 


5.1 


480 


2 


Q9FE68 


733 


140 


5.1 


515 


2 


Q69JV3 


734 


139.5 


5.0 


227 


2 


Q9CAY9 


735 


139.5 


5.0 


241 


1 


UF04__MANES 


736 


139.5 


5.0 


379 


2 


Q93HI3 


737 


139.5 


5.0 


397 


2 


Q825U1 


738 


139.5 


5.0 


431 


2 


Q6YY41 


739 


139.5 


5.0 


452 


2 


Q9XIQ4 


740 


139.5 


5.0 


481 


2 


Q9LVR1 


741 


139 


5.0 


98 


2 


Q645Q1 


742 


139 


5.0 


461 


2 


Q67XH7 


743 


139 


5.0 


461 


2 


Q9LHJ2 


744 


139 


5.0 


464 


2 


Q9SGA8 


745 


139 


5.0 


467 


2 


Q9SY84 


746 


139 


5.0 


474 


2 


Q8H462 


747 


139 


5.0 


492 


2 


Q9SBL1 


748 


138.5 


5.0 


438 


2 


Q9M3H8 


749 


138.5 


5.0 


444 


2 


Q9ZU71 


750 


138.5 


5.0 


446 


2 


Q8RU65 


751 


138.5 


5.0 


449 


1 


UF01_MANES 


752 


138.5 


5.0 


450 


2 


Q8LAE5 


753 


138.5 


5.0 


450 


2 


Q9LME9 


754 


.138.5 


5.0 


460 


2 


Q9M052 


755 


138 


5.0 


154 


1 


UFOG_VTTVI 


756 


138 


5.0 


381 


2 


Q9ZWQ3 



Q91ni4 arabidopsis 
Q8w4c2 arabidopsis 
Q9zwj3 arabidopsis 
Q9s595 nocardia ae 
Q8gz65 arabidopsis 
Q681w3 arabidopsis 
Q9at54 nicotiana t 
Q6z035 oryza sativ 
Q6zf65 oryza sativ 
Q6c8m8 yarrowia li 
068841 streptomyce 
Q8h0f2 gentiana tr 
Q81cw9 bacillus ce 
Q9snb0 arabidopsis 
P93365 nicotiana t 
Q9smg6 dorotheanth 
Q7xsz0 oryza sativ 
Q99912 homo sapien 
Q40288 manihot esc 
Q91k73 arabidopsis 
Q75i86 oryza sativ 
Q65x86 oryza sativ 
0814 98 arabidopsis 
Q65yr5 gentiana sc 
Q69kml oryza sativ 
Q40285 manihot esc 
Q91th2 arabidopsis 
Q8w2b7 zea mays (m 
Q767c8 iris hollan 
Q91eq4 arabidopsis 
Q9ftwl oryza sativ 
Q8h0fl torenia hyb 
Q9fe68 arabidopsis 
Q69jv3 oryza sativ 
Q9cay9 arabidopsis 
Q40286 manihot esc 
Q93hi3 streptomyce 
Q825ul streptomyce 
Q6yy41 oryza sativ 
Q9xiq4 arabidopsis 
Q91vrl arabidopsis 
Q645ql fundulus he 
Q67xh7 arabidopsis 
Q91hj2 arabidopsis 
Q9sga8 arabidopsis 
Q9sy84 arabidopsis 
Q8h462 oryza sativ 
Q9sbll sorghum bic 
Q9m3h8 cicer ariet 
Q9zu71 arabidopsis 
Q8ru65 oryza sativ 
Q40284 manihot esc 
Q81ae5 arabidopsis 
Q91me9 arabidopsis 
Q9m052 arabidopsis 
P51094 vitis vinif 
Q9zwq3 vigna mungo 



757 


138 


5. 


0 


385 


2 


Q9STE4 


758 


138 


5. 


0. 


438 


2 


022183 


759 


138 


5. 


0 


455 


2 


064733 


760 


138 


5. 


0 


458 


2 


Q6VAB4 


761 


138 


5. 


0 


476 


2 


Q6AVQ5 


762 


138 


5. 


0 


478 


2 


Q6Z6F0 


763 


138 


5. 


0 


482 


2 


Q65YR6 


764 


138 


5. 


0 


497 


2 


Q7XTG5 


765 


138 


5. 


0 


501 


2 


Q67TS1 


766 


138 


5. 


0 


503 


2 


Q7XE18 


767 


137 . 5 


5. 


0 


431 


2 


Q8GCS3 


768 


137.5 


5. 


0 


443 


2 


Q9ZR26 


769 


137 . 5 


5. 


o 


463 


2 


Q6ZBR2 


770 


137 . 5 


5. 


o 


467 


2 


P93709 


771 


137 . 5 


5. 


o 


501 


2 


Q6K2Q2 


772 


137. 5 


5. 


o 


512 


2 


Q6Z381 


773 


137 


4 . 


9 


476 


2 


Q9SXF2 


774 


137 


4 . 


9 


491 


2 


Q94CZ1 


775 


136. 5 


4 . 


9 


396 


2 


Q70J67 


776 


136. 5 


4 . 


9 


488 


2 


P93789 


111 


136 


4 . 


9 


215 


2 


Q9HCT7 


778 


136 


4 . 


9 


345 


2 


Q84RI3 


779 


136 


4 . 


9 


452 


2 


P95747 


780 


136 


4 . 


9 


465 


2 


Q65XC9 


781 


136 


4 . 


9 


471 


2 


Q942B3 


782 


136 


4 . 


9 


489 


2 


Q6ESV8 


783 


135 . 5 


4 . 


9 


380 


2 


Q8GWE5 


784 


135. 5 


4 . 


9 


453 


2 


Q65X85 


785 


135 . 5 


4 . 


9 


478 


2 


Q8H3V2 


786 


135 


4 , 


9 


267 


2 


Q8RWA6 


787 


135 


4 . 


9 


470 


2 


Q9ZU72 


788 


135 


4 . 


9 


474 


2 


Q6VAB2 


789 


135 


4 . 


9 


490 


2 


Q9SCP5 


790 


135 


4 . 


9 


523 


2 


Q8S9A1 


791 


134 . 5 


4 . 


9 


305 


2 


052939 


792 


134 . 5 


4 . 


9 


387 


2 


Q81Q01 


793 


134 . 5 


4 . 


9 


392 


2 


Q6HY70 


794 


134 . 5 


4 . 


9 


441 


2 


Q76MR7 


795 


134 . 5 


4 . 


9 


447 


2 


Q9FN28 


796 


134 


4 . 


8 


174 


2 


Q9NF29 


797 


134 


4 . 


8 


419 


2 


Q76KZ6 


798 


134 


4 . 


8 


446 


2 


Q8LAB5 


799 


134 


4 . 


8 


455 


2 


Q9T080 


800 


134 


4 . 


8 


456 


2 


023270 


801 


134 


4 . 


8 


466 


2 


Q8RY86 


802 


134 


4 . 


8 


510 


2 


Q943K5 


803 


133. 5 


4 . 


8 


398 


2 


Q97HL4 


804 


133. 5 


4 . 


8 


462 


2 


Q9AUU9 


805 


133.5 


4 . 


8 


479 


2 


Q7XJ50 


806 


133.5 


4 . 


8 


496 


2 


Q94CY6 


807 


133 


4 . 


8 


365 


2 


Q8GZ08 


808 


133 


4 . 


8 


470 


2 


Q8S342 


809 


133 


4 . 


8 


481 


2 


Q6VAB0 


810 


132.5 


4. 


8 


447 


2 


Q9STE6 


811 


132.5 


4. 


8 


447 


2 


Q9XIQ5 


812 


132.5 


4. 


8 


485 


2 


Q69X83 


813 


132.5 


4. 


8 


495 


2 


Q6VAA9 



Q9ste4 arabidopsis 
022183 arabidopsis 
064733 arabidopsis 
Q6vab4 stevia reba 
Q6avq5 oryza sativ 
Q6z6f0 oryza sativ 
Q65yr6 gentiana sc 
Q7xtg5 oryza sativ 
Q67tsl oryza sativ 
Q7xel8 oryza sativ 
Q8gcs3 pantoea ste 
Q9zr26 perilla fru 
Q6zbr2 oryza sativ 
P93709 nicotiana t 
Q6k2q2 oryza sativ 
Q6z381 oryza sativ 
Q9sxf2 Scutellaria 
Q94czl oryza sativ 
Q70j67 streptomyce 
P93789 solarium tub 
Q9hct7 homo sapien 
Q84ri3 beta vulgar 
P 957 47 streptomyce 
Q65xc9 oryza sativ 
Q942b3 oryza sativ 
Q6esv8 oryza sativ 
Q8gwe5 arabidopsis 
Q65x85 oryza sativ 
Q8h3v2 oryza sativ 
Q8rwa6 arabidopsis 
Q9zu72 arabidopsis 
Q6vab2 stevia reba 
Q9scp5 arabidopsis 
Q8s9al phaseolus a 
052939 calothrix v 
Q81q01 bacillus an 
Q6hy70 bacillus an 
Q76mr7 Scutellaria 
Q9fn28 arabidopsis 
Q9nf 29 caenorhabdi 
Q76kz6 streptomyce 
Q81ab5 arabidopsis 
Q9t080 arabidopsis 
023270 arabidopsis 
Q8ry86 arabidopsis 
Q943k5 oryza sativ 
Q97hl4 Clostridium 
Q9auu9 oryza sativ 
Q7xj50 allium cepa 
Q94cy6 oryza sativ 
Q8gz08 arabidopsis 
Q8s342 capsicum an 
Q6vab0 stevia reba 
Q9ste6 arabidopsis 
Q9xiq5 ' arabidopsis 
Q69x83 oryza sativ 
Q6vaa9 stevia reba 



814 


132 


4.8 


497 


2 


Q6ZBE1 


815 


131.5 


4.8 


287. 


2 


Q7BV94 - 


816 


131.5 


4.8 


346 


2 


Q8W1D9 


817 


131. 5 


4.8 


392 


2 


Q8KNF2 


818 


130.5 


4.7 


170 


2 


Q9LMF1 


819 


130.5 


4.7 


389 


2 


Q83X61 


820 


130. 5 


4.7 


453 


2 


Q9LTH3 


821 


130. 5 


4.7 


478 


2 


Q66PF2 


822 


130 


4.7 


455 


1 


UFOG_HORVU 


823 


130 


4.7 


460 


2 


Q9LTA3 


824 


130 


4.7 


490 


2 


Q6L4T2 


825 


129.5 


4.7 


379 


2 


Q67G38 


826 


129.5 


4.7 


400 


2 


Q97HK4 


827 


129. 5 


4.7 


466 


2 


Q6JAH0 


828 


129.5 


4.7 


472 


2 


Q8LEG2 


829 


129. 5 


4.7 


487 


2 


Q69JV0 


830 


129.5 


4 . 7 


1227 


2 


Q751Z4 


831 


129 


4.7 


414 


2 


Q8S9A3 


832 


129 


4.7 


470 


2 


Q8LR92 


833 


128 . 5 


4 . 6 


394 


1 


YC55 MET JA 


834 


128 . 5 


4.6 


396 


2 


Q939Q6 


835 


128 . 5 


4 . 6 


430 


1 


OLED STRAT 


836 


128 . 5 


4 . 6 


453 


2 


Q9LS21 


837 


128.5 


4 . 6 


469 


2 


Q7XJ49 


838 


128 


4 . 6 


309 


2 


Q94BU0 


839 


128 


4.6 


461 


2 


Q7NHR8 


840 


128 


4 . 6 


462 


2 


Q6JAG7 


841 


127.5 


4.6 


470 


2 


Q852C2 


842 


127 . 5 


4 . 6 


473 


2 


Q8LJC6 


843 


127 . 5 


4 . 6 


474 


2 


Q7F0B2 


844 


127 . 5 


4 . 6 


482 


2 


Q8RU72 


845 


127 . 5 


4 . 6 


493 


2 


Q7XKG0 


846 


127 . 5 


4 . 6 


1209 


2 


Q6CUV2 


847 


127 


4 . 6 


463 


2 


Q8RXA5 


848 


127 


4 . 6 


470 


2 


Q9AUV1 


849 


127 


4 '. 6 


488 


2 


Q8VZE9 


850 


127 


4 . 6 


528 


2 


Q8LNA9 


851 


126. 5 


4.6 


356 


2 


Q7S0K9 


852 


126. 5 


4.6 


486 


2 


Q6AUW7 


853 


126 


4 . 6 


440 


2 


064732 


854 


126 


4.6 


444 


2 


Q8S9A5 


855 


126 


4 . 6 


484 


2 


Q6AUW6 


856 


126 


4.6 


508 


2 


Q6H8F6 


857 


125.5 


4 . 5 


175 


2 


Q6H8F9 


858 


125.5 


4.5 


381 


2 


005496 


859 


125.5 


4.5 


461 


2 


Q93ZG5 


860 


125 . 5 


4.5 


487 


2 


Q7XSY7 


861 


125 


4.5 


463 


2 


Q7XU02 


862 


124.5 


4.5 


504 


2 


Q95GQ5 ' 


863 


124 


4.5 


358 


2 


Q7XKM2 


864 


123.5 


4.5 


464 


2 


Q7XQJ5 


865 


123.5 


4.5 


475 


2 


Q7XMA8 


866 


123.5 


4.5 


482 


2 


Q6Z4C0 


867 


123.5 


4.5 


490 


2 


Q6JAG5 


868 


123 


4.4 


464 


2 


Q7XQJ9 


869 


123 


4.4 


465 


2 


Q8S465 


870 


123 


4.4 


467 


2 


Q93XP7 



Q6zbel oryza sativ 
Q7bv94 bacillus su 
Q8wld9 jatropha cu 
Q8knf2 micromonosp 
Q91mfl arabidopsis 
Q83x61 streptomyce 
Q91th3 arabidopsis 
Q66pf2 fragaria an 
P14726 hordeum vul 
Q91ta3 arabidopsis 
Q614t2 oryza sativ 
Q67g38 streptomyce 
Q97hk4 Clostridium 
Q6jah0 sorghum bic 
Q81eg2 arabidopsis 
Q69jv0 oryza sativ 
Q751z4 ashbya goss 
Q8s9a3 phaseolus a 
Q81r92 oryza sativ 
Q58652 methanococc 
Q939q6 streptomyce 
Q53685 streptomyce 
Q91s21 arabidopsis 
Q7xj49 allium cepa 
Q94bu0 arabidopsis 
Q7nhr8 gloeobacter 
Q6jag7 sorghum bic 
Q852c2 oryza sativ 
Q81jc6 oryza sativ 
Q7f0b2 oryza sativ 
Q8ru72 nicotiana t 
Q7xkg0 oryza sativ 
Q6cuv2 kluyveromyc 
Q8rxa5 zea mays (m 
Q9auvl oryza sativ 
Q8vze9 arabidopsis 
Q81na9 oryza sativ 
Q7s0k9 neurospora 
Q6auw7 oryza sativ 
064732 arabidopsis 
Q8s9a5 phaseolus a 
Q6auw6 oryza sativ 
Q6h8f6 oryza sativ 
Q6h8f9 oryza sativ 
005496 bacillus su 
Q93zg5 arabidopsis 
Q7xsy7 oryza sativ 
Q7xu02 oryza sativ 
Q95gq5 nepenthes s 
Q7xkm2 oryza sativ 
Q7xqj5 oryza sativ 
Q7xma8 oryza sativ 
Q6z4c0 oryza sativ 
Q6jag5 sorghum bic 
Q7xqj9 oryza sativ 
Q8s465 zea mays (m 
Q93xp7 zea mays (m 



871 


123 


4.4 


469 


2 


Q7XMQ0 


872 


123 


4 . 4 


... 637 


2 


Q8S1P6 


873 


122.5 


4.4 


455 


2 


Q9FI97 


874 


122.5 


4.4 


487 


2 


Q75HA1 


875 


122.5 


4.4 


493 


2 


Q84M46 


876 


122. 5 


4 . 4 


1211 


2 


Q9Y751 


877 


122 . 5 


4.4 


2259 


1 


YCF2 PHYPA 


878 


122 


4.4 


135 


2 


Q50456 


879 


122 


4 . 4 


388 


2 


Q9RPA1 


880 


122 


4.4 


464 


2 


Q6JAG9 


881 


122 


4 .4 


501 


2 


Q9FU68 


882 


122 


4 . 4 


1310 


2 


Q7RZT3 


883 


121. 5 


4 . 4 


485 


2 


Q6Z4B7 


884 


121 


4 . 4 


482 


2 


Q7XZD0 


885 


120. 5 


4 . 4 


422 


2 


Q9F826 


886 


120. 5 


4 . 4 


490 


2 


Q9SJL0 


887 


120. 5 


4 . 4 


491 


2 


Q67TS4 


888 


120 . 5 


4 . 4 


497 


2 


Q69X81 


889 


120 . 5 


4 . 4 


498 


2 


Q9FU69 


890 


120 


4 . 3 


418 


2 


Q9RYI3 


891 


119 . 5 


4 . 3 


200 


2 


Q6AT14 


892 


119 . 5 


4 . 3 


270 


2 


Q6AUW5 


893 


119 . 5 


4 . 3 


422 


2 


070023 


894 


119 . 5 


4 . 3 


448 


2 


Q9LJA6 


895 


119 . 5 


4 . 3 


504 


2 


Q94NR5 


896 


119.5 


4 . 3 


504 


2 


Q95GS1 


897 


119. 5 


4 . 3 


504 


2 


Q95GT1 


898 


119. 5 


4 . 3 


504 


2 


Q7IGS0 


899 


119. 5 


4 . 3 


721 


2 


Q6BHK6 


900 


119 


4 . 3 


194 


2 


Q8GSR1 


901 


119 


4 -3 


194 


2 


Q8GSR3 


902 


119 


4 . 3 


196 


2 


Q8GSR2 


903 


119 


4 . 3 


483 


2 


Q75HJ2 


904 


118 . 5 


4 . 3 


418 


1 


MGT STRLI 


905 


118 . 5 


4 . 3 


418 


2 


Q9ADH3 


906 


118 . 5 


4 . 3 


453 


2 


Q9T081 


907 


118.5 


4 . 3 


463 


2 


Q7XU03 


908 


118 . 5 


4 . 3 


497 


2 


Q9FU67 


909 


118 . 5 


4.3. 


504 


2 


Q95GU2 


910 


118 . 5 


4 . 3 


679 


2 


Q8 J1H4 


911 


118 


4 . 3 


464 


2 


Q6JAG8 


912 


118 


4.3 


468 


2 


Q9LVW3 


913 


118 


4 . 3 


507 


2 


Q6Z4L0 


914 


118 


4 . 3 


698 


2 


Q95AS6 


915 


117 . 5 


4.2 


436 


2 


Q76GS1 


916 


117 . 


4.2 


256 


2 


Q7PJA1 


917 


117 


4 . 2 


460 


2 


Q7WTE9 


918 


117 


4.2 


481 


2 


Q8W491 


919 


117 


4.2 


497 


2 


Q67TS2 


920 


116. 5 


4.2 


419 


2 


Q6U868 


921 


116. 5 


4.2 


419 


2 


Q6U870 


922 


116.5 


4.2 


428 


2 


069000 


923 


116.5 


4.2 


472 


2 


Q9FSS3 


924 


116.5' 


4.2 


504 


2 


Q95GQ1 


925 


116.5 


4.2 


504 


2 


Q95GU1 


926 


116 


4.2 


169 


2 


Q8L7D4 


927 


116 


4.2 


488 


2 


Q6ZBR9 



Q7xmq0 oryza sativ 
Q8slp6 oryza sativ 
Q9f i97 arabidopsis 
Q75hal oryza sativ 
Q84m46 oryza sativ 
Q9y751 pichia past 
P61243 physcomitre 
Q50456 mycobacteri 
Q 9 rpa 1 s t r ep t omy ce 
Q6jag9 sorghum bic 
Q9fu68 oryza sativ 
Q7rzt3 neurospora 
Q6z4b7 oryza sativ 
Q7xzd0 glycyrrhiza 
Q9f826 micromonosp 
Q9sjl0 arabidopsis 
Q67ts4 oryza sativ 
Q69x81 oryza sativ 
Q9fu69 oryza sativ 
Q9ryi3 deinococcus 
Q6atl4 oryza sativ 
Q6auw5 oryza sativ 
070023 streptomyce 
Q91ja6 arabidopsis 
Q94nr5 nepenthes b 
Q95gsl nepenthes c 
Q95gtl nepenthes v 
Q7igs0 nepenthes m 
Q6bhk6 debaryomyce 
Q8gsrl triticum ae 
Q8gsr3 triticum ae 
Q8gsr2 triticum ae 
Q75hj2 oryza sativ 
Q5 4 3 8 7 s t r ep tomyce 
Q9adh3 streptomyce 
Q9t081 arabidopsis 
Q7xu03 oryza sativ 
Q9fu67 oryza sativ 
Q95gu2 nepenthes i 
Q8jlh4 ustilago ma 
Q6jag8 sorghum bic 
Q91vw3 arabidopsis 
Q6z410 oryza sativ 
Q95as6 pseudophoen 
Q76gsl nepenthes v 
Q7pjal anopheles g 
Q7wte9 streptomyce 
Q8w491 arabidopsis 
Q67ts2 oryza sativ 
Q6u8 68 mycobacteri 
Q6u870 mycobacteri 
O69000 mycobacteri 
Q9fss3 oryza sativ 
Q95gql nepenthes g 
Q95gul nepenthes i 
Q817d4 arabidopsis 
Q6zbr9 oryza sativ 
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Q7IGS1 
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Q9ZQG3 
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Q7Y232 
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Q95GT5 
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Q6PVW5 
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Q8GHC2 
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Q6U848 
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Q95GP7 
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Q95GQ2 
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Q95GS5 
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Q93ZJ2 
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Q7MAS9 


974 


112.5 


4.1 


637 


2 


023649 
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Q9M8Z7 
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698 
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Q9TL88 
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098710 


978 
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Q9TJQ2 
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Q9TJY8 
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431 
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Q9MS86 
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684 
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Q7YM49 
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Q7YM51 
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Q95AT8 
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Q95D22 



Q9zgb4 streptomyce 
Q9syc4 arabidopsis 
Q7xkgl oryza sativ 
Q95gs9 nepenthes m 
Q95gt0 nepenthes t 
Q95gu3 nepenthes r 
Q7sli0 neurospora 
Q9zgb8 streptomyce 
Q94c57 arabidopsis 
Q9xigl arabidopsis 
Q9y752 Candida alb 
Q6m068 methanococc. 
Q9zgbl streptomyce 
Q7xvpl oryza sativ 
Q94nv9 nepenthes r 
Q95gp2 nepenthes s 
Q95grl nepenthes d 
Q95gr4 nepenthes m 
Q95gr5 nepenthes p 
Q95gt3 nepenthes t 
Q95gt4 nepenthes h 
Q7igsl nepenthes a 
Q83x73 streptomyce 
Q95as0 orania tris 
Q9f8u7 streptomyce 
087831 streptomyce 
Q9zqg3 arabidopsis 
Q43716 petunia hyb 
Q7y232 arabidopsis 
Q95gt5 nepenthes a 
Q67g44 streptomyce 
033939 saccharopol 
08 6304 streptomyce 
Q6pvw5 fragaria an 
049492 arabidopsis 
Q95d4 3 magnolia do 
Q95d67 michelia hy 
Q18872 caenorhabdi 
Q8pwf3 methanosarc 
Q8ghc2 streptomyce 
Q6u848 mycobacteri 
Q95gp7 nepenthes t 
Q95gq2 nepenthes d 
Q95gs5 nepenthes t 
Q93zj2 arabidopsis 
Q7mas9 wolinella...s 
023649 arabidopsis 
Q9m8z7 arabidopsis 
Q9tl88 codonopsis 
098710 pinckneya p 
Q9tjq2 rustia sple 
Q9tjy8 condaminea 
Q9ms86 magnolia st 
Q7ym49 liriodendro 
Q7ym51 eupomatia b 
Q95at8 caryota mit 
Q95d22 magnolia sc 
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Q95D87 


986 


112 
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Q95DA1 


987 
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Q7VFR3 


988 


111.5 
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374 
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Q8X5L5 


989 
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465 
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Q9LZD8 


990 


111.5 
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504 
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Q94PH2 


991 


111.5 
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504 
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Q94Q55 


992 


111.5 
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504 
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Q95GQ8 


993 


111.5 
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504 
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Q95GR6 


994 


111.5 
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504 
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Q95GV2 


995 


111.5 
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504 
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Q95GV5 


996 


111. 5 
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504 
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Q7IGR8 


997 


111.5 


4.0 


504 
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Q7IGS2 


998 


111 


4.0 


403 
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Q8E6C2 


999 


111 


4.0 


432 


2 


Q9RN63 


1000 


111 


4.0 


443 


2 


P95834 


1001 


111 


4.0 


504 
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Q95GP6 


1002 


111 


4.0 


684 
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Q7YM48 


1003 


111 


4.0 


695 
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Q95AS9 


1004 


111 


4.0 


698 
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Q95AT9 


1005 


111 


4.0 


731 


2 


Q95D45 


1006 


111 


4.0 


731 
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Q95DB7 


1007 


111 


4.0 


731 
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Q95DB8 


1008 
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4 . 0 


731 
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Q95DC0 


1009 


111 


4.0 


732 
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Q95D23 


1010 


111 
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732 
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Q95D24 


1011 


111 


4.0 


732 
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Q95D26 


1012 


111 


4 . 0 


732 
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Q95D27 


1013 


111 


4.0 


732 
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Q95D28 


1014 


111 


4.0 


732 
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Q95D30 


1015 


111 


4.0 


732 


2 


Q95D33 


1016 


111 


4.0 


732 


2 


Q95D35 


1017 


111 


4.0 


732 
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Q95D59 


1018 


111 


4 . 0 


732 
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Q95D60 


1019 


111 


4.0 


732 
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Q95D61 


1020 


111 


4.0 


732 
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Q95D62 


1021 


111 


4.0 


732 
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Q95D63 


1022 


111 


4.0 


732 
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Q95D64 


1023 


111 


4.0 


732 
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Q95D65 


1024 


111 


4.0 


732 
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Q95D66 


1025 


111 
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732 
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Q95D68 


1026 


111 
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732 
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Q95D69 


1027 


111 


4.0 


732 


2 


Q95D70 


1028 


111 
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732 
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Q95D71 


1029 


111 
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732 
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Q95D72 


1030 


111 
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732 
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... Q95D73 


1031 


111 
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732 


2 


Q95D74 


1032 


111 
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732 


2 


Q95D75 


1033 


111 
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732 
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Q95D76 


1034 


111 
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732 
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Q95D77 


1035 


111 
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732 
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Q95D78 


1036 


111 
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732 
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Q95D79 


1037 


111 
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732 
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Q95D80 


1038 


111 
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732 
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Q95D85 


1039 


111 
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Q95D86 


1040 


111 


4.0 


732 
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Q95D88 


1041 


111 


4.0 


732 
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Q95D89 



Q95d87 magnolia ko 
Q95dal magnolia gr 
Q7vfr3 helicobacte 
Q8x515 escherichia 
Q91zd8 arabidopsis 
Q94ph2 nepenthes r 
Q94q55 nepenthes g 
Q95gq8 nepenthes m 
Q95gr6 nepenthes r 
Q95gv2 nepenthes d 
Q95gv5 nepenthes a 
Q7igr8 nepenthes m 
Q7igs2 nepenthes s 
Q8e6c2 streptococc 
Q9rn63 streptomyce 
P95834 streptomyce 
Q95gp6 nepenthes m 
Q7ym48 magnolia tr 
Q95as9 chamaedorea 
Q95at9 borassus fl 
Q95d45 liriodendro 
Q95db7 magnolia ma 
Q95db8 magnolia ma 
Q95dc0 magnolia de 
Q95d23 magnolia gu 
Q95d24 magnolia gr 
Q95d26 magnolia me 
Q95d27 magnolia ma 
Q95d28 magnolia le 
Q95d30 magnolia sp 
Q95d33 magnolia fr 
Q95d35 magnolia po 
Q95d59 elmerrillia 
Q95d60 michelia fl 
Q95d61 michelia ba 
Q95d62 michelia ba 
Q95d63 michelia od 
Q95d64 michelia fi 
Q95d65 michelia ma 
Q95d66 michelia ch 
Q95d68 michelia wi 
Q95d69 michelia ve 
Q95d70 michelia sh 
Q95d71 michelia mo 
Q95d72 michelia ma 
Q95d73 michelia ma 
Q95d74 michelia ma 
Q95d75 michelia la 
Q95d76 michelia in 
Q95d77 michelia fo 
Q95d78 michelia do 
Q95d79 michelia ch 
Q95d80 michelia ca 
Q95d85 magnolia st 
Q95d86 magnolia sa 
Q95d88 magnolia bi 
Q95d89 magnolia ze 



1042 


111 


4.0 


732 
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Q95D90 


Q95d90 magnolia sp 


1043 
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4.0 


732. 
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Q95D91 


Q95d91 magnolia sa 


1044 
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Q95D93 


Q95d93 magnolia da 


1045 


111 
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Q95D94 


Q95d94 magnolia ca 


1046 


111 
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732 
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Q95D96 


Q95d96 magnolia ca 


1047 


111 
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732 
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Q95D97 


Q95d97 magnolia gi 


1048 


111 
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732 
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Q95DA3 


Q95da3 magnolia pe 


1049 


111 
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732 
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Q95DA8 


Q95da8 magnolia si 


1050 


111 
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732 
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Q95DA9 


Q95da9 magnolia si 


1051 


111 
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732 
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Q95DB0 


Q95db0 magnolia gl 


1052 


111 
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732 
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Q95DB1 


Q95dbl kmeria sept 


1053 


111 
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732 
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Q95DB2 


Q95db2 kmeria dupe 


1054 


111 


4.0 


732 
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Q95DB3 


Q95db3 magnolia tr 


1055 


111 
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Q95DB4 


Q95db4 magnolia ro 


1056 


111 
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732 
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Q95DB5 


Q95db5 magnolia of 


1057 
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732 
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Q95DB6 


Q95db6 magnolia he 
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Q95DB9 


Q95db9 magnolia fr 


1059 


111 
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732 
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Q95DC1 


Q95dcl magnolia pt 


1060 


111 
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732 
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Q95DC2 


Q95dc2 magnolia li 


1061 


111 
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732 
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Q95DC3 


Q95dc3 magnolia he 


1062 


111 


4.0 


732 
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Q95DC4 


Q95dc4 magnolia de 


1063 


111 


4.0 


732 
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Q95DC5 


Q95dc5 magnolia co 


1064 


111 


4.0 


732 
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Q95DC6 


Q95dc6 magnolia ch 


1065 


111 


4.0 


842 
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CUL8_YEAST 


P47050 saccharomyc 


1066 


110.5 


4.0 


372 
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Q83ZA2 


Q83za2 escherichia 


1067 


110.5 


4.0 


406 


2 


Q93MW2 


Q93mw2 nocardia br 


1068 


110.5 


4.0 


445 
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P74079 


P7 4079 synechocyst 


1069 


110.5 


4.0 


503 
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Q95GR2 


Q95gr2 nepenthes h 


1070 


110.5 


4.0 


504 
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Q95GR8 


Q95gr8 nepenthes 1 


1071 


110.5 


4.0 


504 
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Q95GU9 


Q95gu9 nepenthes o 


1072 


110.5 
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504 
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Q95GV6 


Q95gv6 nepenthes b 
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110.5 
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Q6EW03 


Q6ew03 nymphaea al 
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Q7NXT2 


Q7nxt2 chromobacte 
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110 
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379 
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Q9L9F5 


Q919f 5 streptomyce 
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110 


4.0 


416 
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Q6T1C7 


Q6tlc7 streptomyce 


1077 


110 


4.0 


448 
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Q8RWP1 


Q8rwpl arabidopsis 
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110 
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Q7YM50 


. Q7ym50 galbulimima 
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Q95D25 


Q95d25 magnolia ka 
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Q95D46 


Q95d46 pachylarnax 
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110 
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Q95d81 magnolia li 
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Q95D84 


Q95d84 magnolia cy 
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Q95d92 magnolia de 
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Q95DA4 


Q95da4 manglietia 
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Q95da5 magnolia ni 
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Q95da6 magnolia ni 
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372 
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Q83ZB1 . 


Q83zbl escherichia 
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403 
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Q8E0Q4 


Q8e0q4 streptococc 
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109.5 
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428 
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Q935Z8 


Q935z8 streptomyce 
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109.5 
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439 
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Q7NHW5 


Q7nhw5 gloeobacter 
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443 
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Q9L4U6 


Q914u6 streptomyce 
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109.5 
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504. 
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Q95GT9 


Q95gt9 nepenthes m 
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Q95gv4 nepenthes e 
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Q97ES3 


Q97es3 Clostridium 
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Q9AM85 


Q9am85 riemerella 
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Q69SI6 


Q69si6 oryza sativ 
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109 
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408 
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P96564 


P96564 amycolatops 
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109 
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426 
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Q8RS24 


Q8rs24 streptomyce 
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Q95AS2 


1100 
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Q95D95 


1101 
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1102 
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Q7RJC0 
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1104 
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Q95GP3 
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Q95GQ0 
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Q95GQ7 
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Q95GQ9 
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Q95GS0 


1111 
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Q95GS3 


1112 
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Q95GV0 


1113 


108.5 


3.9 
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Q95GV1 


1114 
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Q95GV7 


1115 


108.5 
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Q716U1 
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Q9GHQ1 
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Y908 MET JA 
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Q8LEW0 
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Q84MN5 
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Q6Z4K7 
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Q67GP6 
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Q9BAQ6 
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Q95D32 
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Q95D34 
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Q95D98 
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Q91LB1 
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Q9RPK3 
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Q95GP5 
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Q95GQ4 
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504 
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Q95GR0 
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504 
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Q95GV3 
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Q7YM60 


1141 ' 
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3.9 


690 
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Q8M9S2 


1142 


107. 5 


3.9 


724 
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Q9TJZ6 


1143 


107.5 
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732 


2 


Q9TJP3 
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107.5 
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732 
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Q9TJS2 


1145 


107 


3.9 


369 
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MURG_CLOTE 


1146 


107 


3.9 


379 
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Y4 52_METAC 


1147 


107 


3.9 


427 
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MURE_CAMJE 


1148 


107 
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501 
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Q7YKK6 


1149 


107 


3.9 


691 
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Q95AS8 


1150 


107 


3.9 


694 
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Q95AQ6 


1151 


107 


3.9 


695 


2 


Q95AR8 


1152 


107 


3.9 


698 


2 


Q95AR9, 


1153 


107 


3.9 


698 


2 


Q95AS7 


1154 


107 


3.9 


698 


2 


Q95AU4 


1155 


107 


3.9 


732 
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Q95D29 



Q95as2 leopoldinia 
Q95d95 magnolia am. 
Q95da7 magnolia wi 
Q7rjc0 Plasmodium 
Q95gr3 nepenthes m 
Q95gp3 nepenthes f 
Q95gp9 nepenthes e 
Q95gq0 nepenthes s 
Q95gq6 nepenthes s 
Q95gq7 nepenthes b 
Q95gq9 nepenthes p 
Q95gs0 nepenthes b 
Q95gs3 nepenthes a 
Q95gv0 nepenthes 1 
Q95gvl nepenthes i 
Q95gv7 nepenthes g 
Q716ul elegia fene 
Q9ghql pseudonemac 
Q58318 methanococc 
Q81ew0 arabidopsis 
Q84mn5 oryza sativ 
Q6z4k7 oryza sativ 
Q67gp6 japonolirio 
Q6ylu5 japonolirio 
Q9baq6 styloceras 
Q95d32 magnolia po 
Q95d34 magnolia pa 
Q95d98 magnolia ta 
Q95d99 magnolia sh 
Q95da0 magnolia gu 
Q95da2 magnolia vi 
013592 saccharomyc 
Q8vav7 white spot 
Q8qtd0 white spot 
Q911bl white spot 
P19158 saccharomyc 
Q9rpk3 streptococc 
Q95gp5 nepenthes m 
Q95gq4 nepenthes t 
Q95gr0 nepenthes f 
Q95gv3 nepenthes s 
Q7ym60 persea amer 
Q8m9s2 erica carne 
Q9tjz6 alseis lugo 
Q9tjp3 simira viri 
Q9tjs2 pentagonia 
Q893r7 Clostridium 
Q8ttil methanosarc 
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Q7ykk6 utricularia 
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Q95d31 magnolia il 
Q95d47 manglietia 
Q95d48 manglietia 
Q95d49 manglietia 
Q95d50 manglietia 
Q95d51 manglietia 
Q95d52 manglietia 
Q95d53 manglietia 
Q95d54 manglietia 
Q95d55 manglietia 
Q95d56 manglietia 
Q95d57 manglietia 
Q95d58 manglietia 
Q8knd7 micromonosp 
Q9rp99 streptomyce 
Q6u850 mycobacteri 
033935 saccharopol 
054224 saccharopol 
Q83wel micromonosp 
Q95gq3 nepenthes v 
Q95gr9 nepenthes t 
Q95gt8 nepenthes p 
Q8m8v3 torricellia 
Q9tjt9 emmenoptery 
Q74n29 nanoarchaeu 
033282 mycobacteri 
Q7ty01 mycobacteri 
Q7wq52 bordetella 
Q7d6n9 mycobacteri 
087830 streptomyce 
Q8vwb7 streptomyce 
Q9zuv0 arabidopsis 
Q7ykj2 utricularia 
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Q95ap9 orbignya ba 
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Q95aq5 allagoptera 
Q95asl manicaria s 
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Q95aq3 cocos nucif 
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Q95ar3 aiphanes ac 
Q95ar6 prestoea ac 
Q95ar7 oenocarpus 
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Q712q2 escherichia 
Q9zqg4 arabidopsis 
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Q95gu6 nepenthes c 
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Q03899 saccharomyc 
Q9c9b0 arabidopsis 
Q95ay9 osmorhiza b 
Q95ag6 osmorhiza 1 
Q7m9d4 wolinella s 
Q8wjz9 baloskion t 
Q659x8 escherichia 
098707 mussaenda e 
Q9tjs0 pogonopus s 
Q9tjv8 chimarrhis 
Q9tl72 pittosporum 
Q9tl70 hedera heli 
Q9vael drosophila 
Q8gsq9 triticum ae 
P96565 amycolatops 
Q9gfh3 liriodendrp 
Q9ms90 austrobaile 
Q8wi66 cyrtostachy 
Q8wi64 howea belmo 
Q6ylv6 ophiopogon 
Q6ylt4 serenoa sp. 
Q8wi77 archontopho 
Q6ylt5 dasypogon b 
Q8wi76 clinostigma 
Q8wi67 burretioken 
Q6ylu9 aponogeton 
Q6ylv5 polygonatum 
Q6ylv3 cyanella hy 
Q6ylv4 polygonatum 
Q67gp2 dasypogon h 
Q95ar0 desmoncus o 
Q95at7 aphandra na 
Q95as4 iriartea de 
Q95aul livistona s 
Q8wi62 oncosperma 
Q8wi63 hydriastele 
Q8wi65 gronophyllu 
Q8wi68 bentinckia 
Q8wi75 linospadix 
Q95ar4 acrocomia a 
Q95as5 ravenea hil 
Q95at3 chambeyroni 
Q95au2 washingtoni 
Q6a215 campylanthu 
Q95at6 phytelephas 
Q95d82 magnolia ac 
Q95d83 magnolia ac 
Q6u852 mycobacteri 
Q6u862 mycobacteri 
Q98el9 rhizobium 1 
Q8gea2 mycobacteri 
Q81gd9 arabidopsis 
Q6vaa3 stevia reba 
Q95gs4 nepenthes t 
Q95gu0 nepenthes m 
Q95gu8 nepenthes 1 
Q71n60 stimpsonia 
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Q85xz9 trevesia lo 
Q85y02 trevesia pa 
Q85y07 trevesia ba 
Q85y00 trevesia pa 
Q85y01 trevesia pa 
Q85y03 trevesia su 
Q9t jpl warszewiczi 
Q7yu76 drosophila 
Q9tl66 helwingia j 
Q9f832 micromonosp 
Q95arl bactris hum 
Q95ar2 astrocaryum 
Q6q962 uncultured 
Q95d44 liriodendro 
Q9tl43 liriodendro 
Q6bwh4 debaryomyce 
Q6x9rl dictyoglomu 
Q86t84 homo sapien 
Q79stl salmonella 
P96056 salmonella 
Q8zmn4 salmonella 
Q9rmp3 mycobacteri 
Q91ps8 arabidopsis 
Q81p23 nierembergi 
P03029 klebsiella 
Q95gp8 nepenthes m 
Q95gu5 nepenthes c 
Q8slr5 clethra bar 
Q85xz5 oplopanax h 
Q9t j s9 mussaenda a 
Q85xz8 trevesia bu 
Q9sc96 melanophyll 
Q8f a89 escherichia 
Q6cny4 kluyveromyc 
Q66b02 yersinia ps 
Q8zfl0 yersinia pe 
Q91555 streptomyce 
P13991 escherichia 
Q9fwcl oryza sativ 
Q9xyr4 schistosoma 
Q8hup0 convallaria 
Q95atl calyptronom 
Q95au6 calamus cae 
Q95aq9 gastrococos 
Q95as3 wettinia hi 
Q95cw5 magnolia el 
P15144 homo sapien 
Q6qrl8 streptomyce 
Q91ne6 arabidopsis 
Q6bkr3 debaryomyce 
Q95gp4 nepenthes 1 
Q95gt7 nepenthes d 
P35558 homo sapien 
Q95ay4 anthriscus 
Q97yg8 sulfolobus 
Q8wk00 elegia stip 
Q716wl trichopus s 
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3.7 


701 


2 


Q85Y10 


Q85yl0 


brassaiopsi 


1363 


101 . 5 


3.7 


703 


2 


Q85XZ4 


Q85xz4 


eleutheroco 


1364 


101 .5 


3.7 


704 


2 


Q85Y08 


Q85y08 


brassaiopsi 


1365 


101 . 5 


3.7 


705 


2 


Q85Y06 


Q85y06 


fatsia japo 


1366 


101 . 5 


3.7 


705 


2 


Q85Y13 


Q85yl3 


brassaiopsi 


1367 


101.5 


3.7 


707 


2 


Q85Y05 


Q85y05 


brassaiopsi 


1368 


101 . 5 


.3 . 7 


714 


2 


Q8MC77 


Q8mc77 


aralidium p 


1369 


101.5 


3.7 


725 


2 


Q8MC69 


Q8mc69 


aralia spin 


1370 


101 . 5 


3.7 


728 


2 


098702 


098702 


luculia gra 


1371 


101 . 5 


3.7 


732 


2 


Q9TJS5 


Q9t js5 


pseudomussa 


1372 


101 . 5 


3.7 


732 


2 


Q9TJY6... : . 


Q9tiy6 
* j j 


capirona de 


1373 


101 . 5 


3 . 7 


732 


2 


Q9TJZ4 


Q9t j z4 


alberta mag 


1374 


101.5 


3.7 


734 


2 


Q9THY8 


Q9thy8 


gardenia th 


1375 


101.5 


3.7 


741 


2 


Q9TL71 


Q9tl71 


eleutheroco 


1376 


101.5 


3.7 


748 


2 


Q9TL75 


Q9tl75 


angelica gi 


1377 


101.5 


3.7 


856 


2 


Q6MAY5 


Q6may5 


parachlamyd 


1378 


101 


3.6 


290 


2 


Q8D4B7 


Q8d4b7 


vibrio vuln 


1379 


101 


3.6 


339 


2 


Q6H8F8 


Q6h8f8 


oryza sativ 


1380 


101 


3.6 


340 


2 


Q7XW21 


Q7xw21 


oryza sativ 


1381 


101 


3.6 


386 


2 


Q73W10 


Q73wl0 


mycobacteri 


1382 


101 


3.6 


393 


2 


087480 


087480 


streptomyce 


1383 


101 


3.6 


420 


2 


Q8MEI2 


Q8mei2 


malva negle 



1384 


101 


3.6 


511 


1 


VAT TORMA 


1385 


101 


3.6 


641 


2 


Q7YM34 


1386 


101 


3.6 


687 


2 


Q6Y1X5 


1387 


101 


3.6 


693 


2 


Q32231 


1388 


101 


3.6 


695 


2 


Q6Y1U3 


1389 


101 


3.6 


697 


2 


003659 


1390 


101 


3.6 


697 


2 


Q95AT0 


1391 


101 


3.6 


698 


2 


Q95AQ7 


1392 


101 


3.6 


699 


2 


Q716V5 


1393 


101 


3.6 


742 


2 


Q9TLC0 


1394 


101 


3.6 


745 


2 


Q9TLC1 


1395 


101 


3.6 


1023 


2 


Q9XYD4 


1396 


100.5 


3.6 


370 


2 


Q97MF5 


1397 


100.5 


3.6 


455 


2 


Q9ALN7 


1398 


100.5 


3.6 


504 


2 


Q95GS7 


1399 


100.5 


3.6 


647 


2 


Q9GEU0 


1400 


100.5 


3.6 


717 


2 


Q8MC98 


1401 


100.5 


3.6 


720 


2 


Q8M9R5 


1402 


100.5 


3.6 


732 


2 


Q32055 


1403 


100.5 


3.6 


741 


2 


Q9TL69 


1404 


100.5 


3.6 


861 


1 


GCR3_YEAST 


1405 


100.5 


3.6 


1066 


2 


Q6CU73 


1406 


100 


3.6 


230 


2 


Q81PS7 


1407 


100 


3.6 


231 


2 


Q6HXZ4 


1408 


100 


3.6 


338 


2 


Q9MSP6 


1409 


100 


3.6 


415 


2 


Q7VGY0 


1410 


100 


3.6 


507 


2 


Q85XY9 


1411 


100 


3.6 


669 


2 


Q6JX92 


1412 


100 


3.6 


680 


2 


Q6Y1U7 


1413 


100 


3.6 


687 


2 


Q6Y1U8 


1414 


100 


3.6 


690 


2 


Q6Y1V1 


1415 


100 


3.6 


690 


2 


Q9BAS2 


1416 


100 


3.6 


693 


2 


Q85XZ7 


1417 


100 


3.6 


694 


2 


Q67GN4 


1418 


100 


3.6 


709 


2 


Q9BAR8 


1419 


100 


3.6 


1026 


2 


Q86T76 


1420 


100 


3.6 


1058 


2 


Q9PCW4 


1421 


100 


3.6 


2543 


2 


Q63CQ7 


1422 


100 


3.6 


3392 


2 


Q7ZA38 


1423 


99.5 


3.6 


348 


2 


Q87SS3 


1424 


99.5 


3.6 


370 


2 


Q93GK9 


1425 


99.5 


3.6 


371 


2 


Q8GH22 


1426 


99.5 


3.6 


392 


2 


Q86D27 


1427 


99.5 


3.6 


396 


2 


029653 


1428 


99.5 


3.6 


518 


2 


Q8D366 


1429 


99.5 


3.6 


622 


1 


PPCC_MOUSE 


1430 


99.5 


3.6 


622 


2 


Q8BSX3 


1431 


99.5 


3.6 


622 


2 


Q8CI37 


1432 


99.5 


3.6 


670 


2 


Q8HUN8 


1433 


99.5 


3.6 


679 


2 


Q8WGU3 


1434 


99.5 


3.6 


701 


2 


Q9SC21 


1435 


99.5 


3.6 


716 


2 


Q85UX6 


1436 


99.5 


3.6 


735 


2 


Q8WGU6 


1437 


99.5 


3.6 


1144 


2 


Q6MF68 


1438 


99 


3.6 


74 


2 


Q941P3 


1439 


99 


3.6 


190 


2 


062371 


1440 


99 


3.6 


343 


2 


Q73CT7 



Q91498 torpedo mar 
Q7ym34 zygogynum p 
Q6ylx5 alania endl 
Q32231 flagellaria 
Q6ylu3 aletris far 
003659 tetraclea c 
Q95at0 geonoma oxy 
Q95aq7 elaeis olei 
Q716v5 croomia jap 
Q9tlc0 tetraclea c 
Q9tlcl clerodendru 
Q9xyd4 dictyosteli 
Q97mf5 Clostridium 
Q9aln7 saccharopol 
Q95gs7 nepenthes n 
Q9geu0 samolus val 
Q8mc98 apium grave 
Q8m9r5 eremosyne p 
Q32055 campanula r 
Q9tl69 panax ginse 
P34160 saccharomyc 
Q6cu73 kluyveromyc 
Q81ps7 bacillus an 
Q6hxz4 bacillus an 
Q9msp6 nymphaea od 
Q7vgy0 helicobacte 
Q85xy9 peganum har 
Q6jx92 helicostyli 
Q6ylu7 halodule wr 
Q6ylu8 gymnostachy 
Q6ylvl xeronema ca 
Q9bas2 buxus hilde 
Q85xz7 dendropanax 
Q67gn4 blandf ordia 
Q9bar8 sarcococca 
Q86t76 homo sapien 
Q9pcw4 xylella fas 
Q63cq7 bacillus ce 
Q7za38 ashbya goss 
Q87ss3 vibrio para 
Q93gk9 klebsiella 
Q8gh22 escherichia 
Q86d27 entamoeba h 
029653 archaeoglob 
Q8d366 wiggleswort 
Q9z2v4 mus musculu 
Q8bsx3 mus musculu 
Q8ci37 mus musculu 
Q8hun8 ipheion dia 
Q8wgu3 pennantia c 
Q9sc21 sollya hete 
Q85ux6 brassaiopsi 
Q8wgu6 pennantia c 
Q6mf68 parachlamyd 
Q941p3 arachis hyp 
062371 caenorhabdi 
Q73ct7 bacillus ce 



1441 


99 


3.6 


371 


2 


Q6U5X7 


Q6u5x7 


klebsiella 


1442 


99 


3.6 


429 


2 


Q9GFH2 


Q9gfh2 


saururus ce 


1443 


99 


3.6 


439 


2 


Q62CH1 


Q62chl 


burkholderi 


1444 


99 


3.6 


493 


2 


Q8CWG5 


Q8cwg5 


yersinia pe 


1445 


99 


3.6 


504 


2 


Q95GS2 


Q95gs2 


nepenthes n 


1446 


99 


3.6 


612 


2 


Q6UQ74 


Q6uq74 


sinningia s 


1447 


99 


3.6 


627 


2 


Q7YM46 


Q7ym46 


cephalospha 


1448 


99 


3.6 


659 


2 


Q7YM36 


Q7ym36 


virola sebi 


1449 


99 


3.6 


665 


2 


Q9GGT9 


Q9ggt9 


dulichium a 


1450 


99 


3 . 6 


676 


2 


Q7YM4 0 


Q7ym40 


knema lauri 


1451 


99 


3 . 6 


67 6 


2 


Q7YM43 


Q7ym43 


gymnacranth 


1452 


99 


3.6 


680 


2 


Q8WI61 


Q8wi61 


ptychosperm 


1453 


99 


3.6 


680 


2 


Q7YM38 


Q7ym38 


myristica f 


1454 


99 


3. 6 


684 


2 


Q8HUP1 


Q8hupl 


asparagus f 


1455 


99 


3 . 6 


686 


2 


P92299 


P92299 


glechoma he 


1456 


99 


3.6 


68 6 


2 


P92329 


P92329 


plectranthu 


1457 


99 


3 . 6 


707 


2 


Q9GHZ4 


Q9ghz4 


chaenorhinu 


1458 


99 


3. 6 


707 


2 


Q9MVJ0 


Q9mvj0 


heritiera 1 


1459 


99 


3 . 6 


708 


2 


Q9GHW4 


Q9ghw4 


gambelia sp 


1460 

_L *1 \J \J 


99 


3.6 


738 


2 


019931 


019931 


tetrachondr 


1461 


99 


3 . 6 


1058 


2 


Q9VP94 


Q9vp94 


drosophila 


1462 


99 


3. 6 


2485 


2 


Q7RRE3 


Q7rre3 


p 1 a s mo d i um 


1463 


99 


3.6 


3392 


2 


Q75AD9 


Q7 5ad9 


ashbya goss 


1464 


98 . 5 


3.6 


371 


2 


Q67DV4 


Q67dv4 


shigella dy 


1465 


98 . 5 


3 . 6 


371 


2 


Q83WA1 


Q8 3wal 


escherichia 


1466 


98 . 5 


3 . 6 


383 


2 


Q7WC49 


Q7wc49 


bordetella 


1467 


98 . 5 


3 . 6 


606 


2 


Q6FRS6 


Q6frs6 


Candida gla 


1468 


98. 5 


3.6 


700 


2 


Q9TLF5 


Q9tlf5 


globularia 


1469 


98 . 5 


3 . 6 


732 


2 


Q9T JY9 


Q9ti v9 


calycophyll 


1470 


98 . 5 


3.6 


733 


2 


019941 


019941 


globularia 


1471 


98 . 5 


3 . 6 


942 


2 


P74081 


P74081 


synechocyst 


1472 


98 . 5 


3 . 6 


1049 


1 


VP39 YEAST 


Q07468 


saccharomyc 


1473 


98 . 5 


3. 6 


1656 


2 


Q7R489 


Q7r489 


giardia lam 


1474 


98 . 5 


3.6 


3645 


2 


Q750B4 


Q750b4 


ashbya goss 


1475 


98 


3 . 5 


230 


2 


Q6KS13 


Q6ksl3 


bacillus an 


1476 


98 


3 . 5 


307 


2 


Q7XXJ7 


Q7xxj7 


oryza sativ 


1477 


98 


3 . 5 


343 


2 


Q81UI1 


Q81uil 


bacillus an 


1478 


98 


3.5 


346 


2 


Q6I2Q8 


Q6i2q8 


bacillus an 


1479 


98 


3.5 


373 


2 


• Q8KUL2 


Q8kul2 


streptococc 


1480 


98 


3 . 5 


379 


2 


Q8NLS7 


Q8nls7 


corynebacte 


1481 


98 


3.5 


382 


2 


Q9ZA43 


Q9za43 


streptomyce 


1482 


98 


3 . 5 


391 


1 


CAP J STAAU 


P39859 


staphylococ 


1483 


98 


3.5 


408 


2 


Q73TF6 


Q73tf6 


mycobacteri 


1484 


98 


3 . 5 


442 


2 


Q53881 


Q53881 


streptomyce 


1485 


98 


3 . 5 


444 


2 


Q8F9G4 


Q8f 9g4 


leptospira 


1486 


. - 98 


3 . 5 


543 


2 


Q7YXZ2 


Q7yxz2 


cryptospori 


1487 


98 


3.5 


622 


1 


PPCC CHICK 


P05153 


gallus gall 


1488 


98 


3 . 5 


623 


2 


Q7YM41 


Q7ym41 


iryanthera 


1489 


98 


3.5 


636 


2 


Q716U7 


Q716u7 


stichoneuro 


1490 


98 


3.5 


667 


2 


Q6JX80 


Q6jx80 


perebea gui 


1491 


98 


3.5 


669 


2 


Q8HUN9 


Q8hun9 


agapanthus 


1492 


98 


3.5 


67 0 


2 


Q6JXA8 


Q6jxa8 


batocarpus 


1493 


98 


3.5 


671 


2 


Q8HUN6 


Q8hun6 


narcissus e 


1494 


98 


3.5 


672 


2 


Q95AT4 


Q95at4 


areca vesti 


1495 


98 


3.5 


686 


2 


Q6Y1V2 


Q6ylv2 


tecophilaea 


1496 


98 


3.5 


688 


2 


Q6Y1V9 


Q6ylv9 


dracaena au 


1497 


98 


3.5 


690 


2 


Q6Y1W6 


Q6ylw6 


hypoxis jun 



1498 


98 


3.5 


692 


2 


Q67GN8 


Q67gn8 


roystonea 


P 


1499 


98 


3.5 . 


- 694 


2 


047200 


047200 


narcissus 


e 


1500 


98 


3.5 


697 


2 


Q95AR5 


Q95ar5 


roystonea 


o 



ALIGNMENTS 



RESULT 1 
Q6UXC4 

ID Q6UXC4 PRELIMINARY; PRT; 523 AA. 

AC Q6UXC4; 

DT 05-JUL-2004 (TrEMBLrel. 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE Glucuronosyltransf erase. 

GN ORFNames=UNQ842 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22887296; PubMed=12975309 ; DOI=10 . 1101/gr . 1293003 ; 

RA Clark H. F. , Gurney A. L . , Abaya E., Baker K. , Baldwin D., Brush J., 

RA Chen J. , Chow B. , Chui C. , Crowley C, Currell B., Deuel B., Dowd P . , 

RA Eaton D., Foster J., Grimaldi C, Gu Q., Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L., Jin Y. , Johnson S., Lee J., 

RA Lewis L. , Liao D., Mark M., Robbie E., Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J. , Vagts A., 

RA Vandlen R. , Watanabe C, Wieand D., Woods K. , Xie M. H . , Yansura D., 

RA Yi S., Yu G., Yuan J., Zhang M., Zhang Z., Goddard A., Wood W.I., 

RA Godowski P. ; 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; AY358416; AAQ88782.1; 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 

SQ SEQUENCE . 523 AA; 59580 MW; 5DE9CD92D1FBA18F CRC64 ; 

Query Match 100.0%; Score 2768; DB 2; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.7e-210; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 



Qy 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 
I I I I II I I I I I I I II I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I 



Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 




1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qv 


181 


LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qv 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVXVTL 


300 


Qv 


301 


GSMVOTCQNPEIFKEMNNAFAHLPQGVIWKC^^ 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GSMVNTCQNPEIFKE>INNAFAHLPQGVIWKCQCSH^ 


360 


Ov 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 


Qv 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 2 
Q8NBP2 



ID Q8NBP2 PRELIMINARY; PRT; 523 AA. 

AC Q8NBP2; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24 , Last annotation update) 

DE Hypothetical protein PSEC0073. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia;. Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Ota T . , Nishikawa .T. , Suzuki Y., Kawai-Hio Y., Hayashi K., Ishii S., 

RA Saito K., Yamamoto J., Wakamatsu A., Nagai T., Nakamura Y:, 

RA Nagahari K., Sugano S., Isogai T.; 

RL Submitted (MAR-2002) to the EMBL/GenBank/DDB J databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; AK075383; BAC11583.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 



SQ SEQUENCE 523 AA; 



59618 MW; 89928A26AB9C09F5 CRC64; 



Query Match 99.5%; Score 2754; DB 2; Length 523; 

Best. Local Similarity 99.6%; Pred. No. 8.5e-209; 

Matches 521; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 




Db 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 






1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

I I I I I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ ■ 1 ■ 1 1 




Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


181 


LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ' 




nh 


181 


LSWPVFRSLLTDHMDFWGRVTCNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVXVTL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 




nh 


241 


LL KAE LW F I N S D FAFD FAR P L L PNTVYVGGLME K P I K P VP QDL EN F I AK FE D S G FVLVT L 


300 


Qy 


301 


GSMVTTTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPkDV^ 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ■ > ■ ■ > ■ > ■ i 




Db 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVlilAAN 


360 


Qy 


361 


PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 


420 






1 I I I I I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 


420 


Qy 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 








Mill 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLLDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 3 
Q6NUS8 



ID Q6NUS8 PRELIMINARY; PRT.;, 523 AA. 

AC Q6NUS8; 

DT 05-JUL-2004 (TrEMBLrel. 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 21, Last annotation update) 

DE Hypothetical protein FLJ34658. 

GN Name=FLJ34658; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RX MEDLINE=22388257; PubMed=12477932 r DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R. L . , Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenitien CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E. , 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P . , Prange C.\ 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez. A. C. , Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RA Strausberg R. ; 

RL Submitted (APR-2004) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; BC068446; AAH68446.1; -. 

DR GO; GO:0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Hypothetical protein; Transferase. 

SQ SEQUENCE 523 AA; 59150 MW; 602AE3D75CE986BD CRC64; 



Query Match 79.4%; Score 2198; DB 2; Length 523; 

Best Local Similarity 78.8%; Pred. No. 7.1e-165; 

Matches 413; Conservative 46; Mismatches 63; Indels 2; Gaps 2; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I II I I I III I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I : I I I I I I I I I : 
Db 1 MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQILQEHGHNVTML-HQS 59 

Qy 61 GPFM-PDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLAL 119 

I I : I I I : I I II I I I I I : I I I I I : I I I I : : I I I I : I I : : : I 
Db 60 GKFLI PDI KEEEKS YQVIRWFS PEDHQKRI KKHFDS YI ETALDGRKESEALVKLMEI FGT 119 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

III: I : I I I I I I I I I I I I : I : I II I I : I II I I I I I I I I I I I I I : I I I I : I I I I 
Db 120 QCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVKPFVAILPTTFGSLDFGLPS 179 

Qy 180 PLSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 180 PLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQSTFDNTIKEHFPEGSRPVLSH 239 



Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

I M I I I I I I : I I I I I I I I I I I I I I II I I : I II I I I I I I I I I I I I : I I I I I I I : I I I I I 
Db 240 LLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPIKPVPQDLDNFIANFGDAGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSD 359 

111:11 I : I : I : I : I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I 
Db 300 FGSMLNTHQSQEVLKK^4HNAFAHLPQGVIWTCQSSHWPRDVHIAT^^^IVDWLPQSDLI^ 359 

Qy 360 HPS I RLFVTHGGQNSIMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAET 419 

I I I I I I I I I I I I I I I : I I I I : I I I I I I I : I : III I I I I I II : I I I I : I : : I : I 
Db 360 HPS I RLFVTHGGQNSVMEAI RHGVPMVGLPWGDQHGNMVTWVAiCNYGVS I RLNQVTADT 419 

Qy 420 LALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVXQTGGATHLKPYVFQQP 479 

I I I I I :: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 420 LTLTMKQVIEDKRYKSAWAASVILHSQPLSPAQRLVGWIDHILQTGGATHLKPYAFQQP 479 

Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I : 111111111:1 
Db 480 WHEQYLIDVFVFLLGLTLGTMWLCGKLLGWARWLRGARKVKKT 523 



RESULT 4 
Q96DM6 

ID Q96DM6 PRELIMINARY; PRT; 523 AA. 

AC Q96DM6; 

DT 01-DEO2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein FLJ32504. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Small intestine; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T., Suzuki Y., Nishikawa T. f Otsuki T. , Sugiyama T., Irie R. , 

RA Wakamatsu A. , Hayashi K. , Sato H., Nagai K. , Kimura K., Makita H., 

RA Sekine M., Obayashi M. , Nishi T., Shibahara T., Tanaka T . , Ishii S., 

RA Yamamoto J., Saito K. , Kawai Y., Isono Y., Nakamura Y., Nagahari K., 

RA Murakami K. , Yasuda T., Iwayanagi T., Wagatsuma M. , Shiratori A. , 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H. , Kondo H., Sugawara M. , 

RA Takahashi M. , Kanda K. , Yokoi T., Furuya T., Kikkawa E., Omura Y., 

RA Abe K., Kamihara K., Katsuta N., Sato K.,. Tanikawa M. , Yamazaki M. , 

RA Ninomiya K. , Ishibashi T., Yamashita H. , Murakawa K. , Fujimori K., 

RA Tanai H., Kimata M. , Watanabe M. , Hiraoka S w Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S. f Watanabe S., Yosida M. , Hotuta T. , Kusano J. , 

RA Kanehori K., Takahashi-Fuj ii A., Hara H., Tanase T., Nomura Y., 

RA Togiya S., Komai F. , Hara R. , Takeuchi K. , Arita M. , Imose N . , 

RA Musashino K., Yuuki H., Oshima A. , Sasaki N., Aotsuka S., 

RA Yoshikawa Y., Matsunawa H. f Ichihara T. f Shiohata N. , Sano S., '. 

RA Moriya S., Momiyama H. f Satoh N., Takami S. f Terashima Y. , Suzuki O., 

RA Nakagawa S., Senoh A. f Mizoguchi H. , Goto Y., Shimizu F., Wakebe H. f 

RA Hishigaki H., Watanabe T., Sugiyama A. , Takemoto M., Kawakami B. , 

RA Yamazaki M. , Watanabe K., Kumagai A. , Itakura S., Fukuzumi Y., 



RA Fujimori Y., Komiyama M. , Tashiro H. , Tanigami A., Fujiwara T., 

RA Ono T. , Yamada.K., Fujii Y. , Ozaki K., Hirao M. , Ohmori Y., 

RA Kawabata A. , Hikiji T. , Kobatake N. , Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T. , Shigeta K. , Senba T., 

RA Matsumura K. , Nakajima Y. , Mizuno T., Morinaga M. , Sasaki M., 

RA Togashi T., Oyama M. , Hata H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K. , 

RA Okumura K. , Nagase T. , Nomura N., Kikuchi H., Masuho Y. , Yamashita R. , 

RA Nakai K., Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; AK057066; BAB71358.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 

SQ SEQUENCE 523 AA; 59148 MW; 2034D6E90863EA8E CRC64; 



Query Match 79.2%; Score 2193; DB 2; Length 523; 

Best Local Similarity 78.8%; Pred. No. 1.8e-164; 

Matches 413; Conservative 45; Mismatches 64; Indels 2; Gaps 2; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I III I I I I I I I I I I I I I I I : I I I I I I I : I I I I I I I I : I I I I I I I I I : 
Db 1 MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQILQEHGHNVTML-HQS 59 

Qy 61 GPFM-PDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLAL 119 

I I : I I I : I I I I II I I I : I I I I I : I I I I : : I I I I : I I : : : I 
Db 60 GKFLI PDI KEEEKS YQVI RWFS PEDHQKRI KKHFDS YI ETALDGRKESEIALVKLMEI FGT 119 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

III: 1:111111111111 I : I II I I : I I I I I I I I I I I I I I I I : I I I I : I I I I 
Db 120 QCSYLLSRKDIMDSLKNENCDLVFVEAFDFCSFLIAEKLVKPFVAILPTTFGSLDFGLPS 179 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 180 PLSWPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQSTFDNTIKEHFPEGSRPVLSH 239 

Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

I I I I I I I I I : I I I I I I I I I I I I I I I II I : I I I I I I I I I I I I I I I : II I I I I I : I I I I I 
Db 240 LLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPIKPVPQDLDNFIANFGDAGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV^ 359 

111:11 I : I : I : I : I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I 
Db 300 FGSMLNTHQSQEV^KKMHNAFAHLPQGVIWTCQSSHWPRDVTiLATNVKIVDWLPQSDLLA 359 

Qy 360 HPSI RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAET 419 

I I I I I I I I I I I I I I I : I I I : I I I I I I I : I : I I I I I I I I I I : I I I I : I : : I : I 
Db 360 H P S I RL FVT H GGQN S VMET I RHGVPMVG L P VN GDQH GNMVRWAKN YGVS I RLNQVTADT 419 

Qy 420 LALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQP 479 

I I I I I :: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 420 LTLTMKQVIEDKRYKSAWAASVILHSQPLSPAQRLVGWIDHILQTGGATHLKPYVFQQP 479 



Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I II I I I I I : I 
Db 480 WHEQYLIDVFVFLLGLTLGTMWLCGKLLGWARWLRGARKVKKT 523 



RESULT 5 
Q8R0Y5 

ID Q8R0Y5 PRELIMINARY; PRT; 523 AA. 

AC Q8R0Y5; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Expressed sequence AI746432. 

GN Name=AI746432; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxI D=l 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas". 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R. F. , Jordan H . , Moore T., Max S.I.,. Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan.P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RA Strausberg R. ; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; BC025940; AAH25940.1; 

DR MGD; MGI:2146055; AI746432. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos__trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 



SQ SEQUENCE 523 AA; 59742 MW; F2EF34F41C3DEB95 CRC64; 



Query Match 65.6%; Score 1815; DB 2; Length 523; 

Best Local Similarity 65.4%; Pred. No. 1.3e-134; 

Matches 342; Conservative 73; Mismatches 106; Indels 2; Gaps 2; 



Qy 1 MAGQRVXLLVGFLLPGVXLSE1AAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I I I I I I I I I I I I I I I I I I I : I II : I I : I I I I I I I I I : I I : : 
Db 1 MAAHRSWLLVSFFLLEVLLLEAAKILTISTLSASHYILMNRVSQILQGGGHDVIKLLYEG 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I : I I I : I I I I I I I : I I I I I : I : : : : I III: I I : : I I 
Db 61 GD-IPDFRKENSSYQVINWRLPEDQQKTFENRWHRLIDEYAYGRSKYHTLLKIHQYFADL 119 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

I I I I : I I I I I : I : I I I I : I :::: I I I I I I I I II I I : I I : : I I I I 
Db 120 CSHLLSRKDIMELLQKENFDLVLLDSMDLCSFLIVEKLGKRFVSFLPFQFSYMDFGLPNA 179 

Qy 180 PLSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I I I : I I I I I I I I I I I I I I I I I I ::::: I : I I :: I I I I I I : I I I I 
Db 180 PLSYAPWGSGLTDQMDFWGRVKNILMFFHFTKKRRDIFSQYGNTVQEHFAEGSQPVLSD 239 

Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I :: I I :: I : I I I I I : I I :: I I I I I I I I I 
Db 240 LLLKAELWFVNSDFALDFARPLFPNTVYVGGLLDKPVQPIPQDLEDFISQFGDSGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVTWKCQCSHWPKDV^ 359 

I I : I : I : II I I I I : I I I I I I I I I : I I : I I I I I I I I I I I I I : I I I I I I I I I 
Db 300 LDS WSMIQSKEI I KEMNSAFAHLPQGVXWTCKS SHWPKDVSLAPNVKIMDWLPQI DLLA 359 

Qy 360 HPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMWVT^ 419 

I I I I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : INI: 

Db 360 HPSIRLFvTHGGMNSVMEAVTIHGVPMVGIPFFGDQPEN^ 419 

Qy 420 IALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQP 479 

I ||:::||:|||:||:|: II I I I I : I I I I I I I I I I : I I I I I I Mill MM 
Db 420 FLLTMKEVIEDQRYKTAAMASKVI RNSHPLTPAQRLVGWI DHI LQTGGAAHLKPYAFQQP 479 

Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKE 522 

II II I: I I I : I I I II II I II II I : I : I MUM 

Db 480 WHEQYMLDVFLFLLGLTLGTLWLSVKVXVAVTRYLSISRKVKQ 522 



RESULT 6 
Q8JZZ0 

ID Q8JZZ0 PRELIMINARY; PRT; 523 AA. 

AC Q8JZZ0; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein AI313915. 

GN Name=AI313915; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A. , Grouse L . H . , Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N. K. , 

RA Hopkins R.F., Jordan H. f Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P. , Prange C, 

RA Raha S.S., Loquellano N . A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butter field Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RA Strausberg R. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltrans f erase family. 

DR EMBL; BC034837; AAH34837.1; 

DR MGD; MGI:2145969; AI313915. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P:metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Hypothetical protein; Transferase. 

SQ SEQUENCE 523 AA; 59662 MW; 4F7BD5ACBAFB5127 CRC64; 



Query Match 65.2%; Score 1804; DB 2; Length 523; 

Best Local Similarity 65.4%; Pred. No. 9.4e-134; 

Matches 342; Conservative 69; Mismatches 110; Indels 2; Gaps 2; 

Qy 1- MAGQRVXLLVGFLLPGVXLSE1AAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I I I : I I I : I I I I I I I I I I I : I I I : : : I I I I : I : I I I I I I : : 
Db 1 MAAHRRWLLMS FLFLEVI LLEAAKI LTI STLSASH YI VI SRVSQVLHEGGHNVTKLLYES 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I I I : I I : I I I I I : I I I I : : : I II III I I : : I 

Db 61 AN- I PDFRKEKPS YQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

II I : I I I I I I I I I I I I I : I : : : : I I I I I I I I I I I : I I : : I I I I 
Db 120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 

Qy 180 P L S YVP VFRS LLT DHMD FWGRVKN FLMFFS FCRRQQHMQ S T FDNT I KEH FT EG S RPVL S H 239 

I I I I I I : I I I I I I I I I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 



Db 


180 


PLSYAPVYGSGLTDQMDFWGRVKNFLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 


239 


Qy 


240 


LLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 


299 






1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :: 1 1 :: 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


240 


LLLKAELWFVNSDFALDFARPLFPNTVYVGGLLDKPVQPIPQDLENFISQFGDSGFVLVA 299 


Qy 


300 


LGSIWNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDL^ 


359 






1 1 1 : 1 : 1 : II 1 1 1 1 : 1 1 1 M 1 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 : 1 1 1 1 : 1 1 1 1 1 : 1 1 1 1 




Db 


300 


LGSIVSMIQSKEIIKEMNSAFAHLPQGVLWTCKTSHWPKDVSLASNVKIMDWLPQTDLLA 359 


QV 


360 


HPS I RLFVTHGGQNSIMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAET 


419 






1 1 1 1 1 1 1 1 1 II 1 Ihlll: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI: MM: 




Db 


360 


H P S I RL FVTH GGMN S VMEAVHH GVPMVG IPFFFDQP ENMVRVEAKN L G VS I Q LQT L KAE S 


419 


Qv 


420 


LALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQP 


479 






II 1 1 : 1 : 1 1 1 1 1 1 1 1 1 : 1 : : 1 1 1 II : 1 1 1 1 : 1 1 1 1 1 : 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


420 


FALTMKKIIEDKRYKSAAMASKIIRHSHPLTPAQRLLGWIDHILQTGGAAHLKPYAFQQP 


479 


Qy 


480 


WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 








1 II 1 1 : 1 1 1 : 1 1 1 1 1 1 1 II 1 1 1 : 1 : 1 1 1 1 1 1 




Db 


480 


WHEQYMLDVFLFLLGLMLGTLWLSVKVLVAWRYLSIATKVKE 522 





RESULT 7 
Q8VC11 

ID Q8VC11 PRELIMINARY; PRT; 523 AA. 

AC Q8VC11; 

DT 01-MAR-2002 (TrEMBLrel. 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE Hypothetical protein AI313915 (Mus musculus adult male liver tumor 

DE cDNA, RIKEN full-length enriched library, clone : C730018P11 

DE product : hypothetical UDP-glucoronosyl and UDP-glucosyl transferase 

DE containing protein, full insert sequence) . 

GN Name=AI313915; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow.K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 



RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A. , Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB/N; TISSUE=Liver ; 

RA Strausberg R. ; 

RL Submitted (JAN-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=99279253; PubMed=10349636; DOI=10 . 1016/S0076-6879 (99) 03004-9; 

RA Carninci P., Hayashizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Meth. Enzymol. 303:19-44(1999). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=21085660; PubMed=11217851 ; DOI=10 . 1038/35055500 ; 

RA RIKEN FANTOM Consortium; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=20499374; PubMed=11042159 ; DOI=10 . 1101/gr . 145100; 

RA Carninci P . , Shibata Y., Hayatsu N., Sugahara Y. f Shibata K. , Itoh M. , 

RA Konno H., Okazaki Y., Muramatsu M. , Hayashizaki Y. ; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [7] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=20530913; PubMed=11076861 ; DOI=10 . 1101/gr . 152600 ; 

RA Shibata K., Itoh M. , Aizawa K. , Nagaoka S., Sasaki N., Carninci P., 

RA Konno H., Akiyama J., Nishi K. , Kitsunai T., Tashiro H., Itoh M. , 

RA Sumi N., Ishii Y., Nakamura S., Hazama M. , Nishine T., Harada A., 

RA Yamamoto R. , Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., 

RA Fujiwake S. f Inoue K., Togawa Y., Izawa M. , Ohara E., Watahiki M. , 

RA Yoneda Y., Ishikawa T. , Ozawa K., Tanaka T., Matsuura S., Kawai J., 

RA Okazaki Y. , Muramatsu M. , Inoue Y. , Kira A., Hayashizaki Yv; 

RT "RIKEN integrated sequence analysis (RISA) system-384-f ormat 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN [8] 



RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RA Adachi J., Aizawa K., Akimura T., Arakawa T. , Bono H., Carninci P., 

RA Fukuda S., Furuno M. , Hanagaki T., Hara A., Hashizume W., 

RA Hayashida K. , Hayatsu N., Hiramoto K. , Hiraoka T., Hirozane T., 

RA Hori F., Imotani K. , Ishii Y. , Itoh M. , Kagawa I., Kasukawa T., 

RA Katoh H., Kawai J., Kojima Y., Kondo S., Konno H., Kouda M. , Koya S., 

RA Kurihara C. , Matsuyama T., Miyazaki A. , Murata M. , Nakamura M. , 

RA Nishi K., Nomura K., Numazaki R. , Ohno M. , Ohsato N. , Okazaki Y. , 

RA Saito R. , Saitoh H., Sakai C, Sakai K., Sakazume N., Sano H., 

RA Sasaki D . , Shibata K., Shinagawa A., Shiraki T., Sogabe Y., Tagami M. , 

RA Tagawa A., Takahashi F. , Takaku-Akahira S., Takeda Y., Tanaka T., 

RA Tomaru A., Toya T., Yasunishi A., Muramatsu M. , Hayashizaki Y. ; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; BC022134; AAH22134.1; -. 

DR EMBL; AK050128; BAC34080.1; -. 

DR MGD; MGI:2145969; AI313915. 

DR GO; GO:0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransferase; Hypothetical protein; Transferase. 

SQ SEQUENCE 523 AA; . 59672 MW; BC7BD6ADF197ADD9 CRC64; 

Query Match 65.1%; Score 1802; DB 2; Length 523; 

Best Local Similarity 65.4%; Pred. No. 1.4e-133; 

Matches 342; Conservative 68; Mismatches 111; Indels 2; Gaps 2; 
Qy 1 MAGQRV^LLVGFLLPGVXLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 



Db 



M i i i . ii t-i i I i i i i I i i i ■ i i I • • • iii i — i • i i i i i i 
1 MAAHRRWLLMS FLFLEVILLEAAKI LTI STLSASHYIVI SRVSQVLHEGGHNVTKLLYES 60 



Qy 



Db 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I I I : I I : I I I I I : I I I I :: : I II III I I : : I 

61 AN-IPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 



Qy 



121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 



II I • II II II M i I M I • I I I I I . I I I I I II* I 1 • • I I I I 

120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 




Db 



Qy 



Db 



180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I II: I III I I I I I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 
180 PLSYAPWGSGLTDQMDFWGRVKNFLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 239 



Qy 



240 LLLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 





Db 



240 LLLKAELWFVNSDFALDFARPLFPNTVYVGGLLDKPVQPIPQDLENFISQFGDSGFVLVA 299 



Qy 



Db 



300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTiLAANVT<IvT)WLPQSDL^ 359 

I I I : I : I : II I I I I : I I I I I I I I I : I I : I I I I I I I II I I I I : I I I I I : I I I I 
300 LGSIVSMIQSKEIIKEMNSAFAHLPQGVXWTCKTSHWPKDVSLAPNVKIMDWLPQTDLLA 359 



Qy 
Db 



360 
360 



HPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMWVTj^KFGVSIQLKKLKAET 419 

I II I I I I I I I I I Ihlll: I I I I II I I I I I I I I I I I I I I I I I I I II I : MM: 
HPSIRLFVTHGGMNSVTyiEAVHHGVPMVGIPFFFDQPENMVRVEAKNLGVSIQLQTLKAES 419 



Qy 420 LALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQP 479 

I I I I : I. : I I I I I I I I I : I :: I I I I I : I I I I : I I I I I : I I I I I I I I I I I I I I I 
Db 420 FALTMKKIIEDKRYKSAAMASKIIRHSHPLTPAQRLLGWIDHILQTGGAAHLKPYAFQQP 479 

Qy 4 80 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

I I I I I : I I I : I I I I I I I I I I I I : I : I I I I I I 

Db 4 80 WHEQ YMLDVFL FLLGLMLGT LWL S VKVLVAVT R YL S I AT KVKE 522 



RESULT 8 
Q63ZR6 

ID Q63ZR6 PRELIMINARY; PRT; 523 AA. 

AC Q63ZR6; 

DT 25-OCT-2004 (TrEMBLrel. 28, Created) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last sequence update) 

DT 25-OCT-2004 (TrEMBLrel. 28, Last annotation update) 

DE Hypothetical protein. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC T I S SUE=Ova r y ; 

RX MEDLINE=22341132; PubMed=12454 917 ; DOI=10 . 1002/dvdy. 10174 ; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J., Clifton S.W., 

RA Richardson P.; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative."; 

RL Dev. Dyn. 225:384-391(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX PubMed=12477932; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon. D.K. , Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 



RC TISSUE=Ovary; 

RA Klein S., Gerhard D.S.; 

RL Submitted (SEP-2004) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC082844; AAH82844.1; 

KW Hypothetical protein. 

SQ SEQUENCE 523 AA; 60436 MW; 73B728B7D1CE5CA0 CRC64; 

Query Match 41.4%; Score 1145; DB 2; Length 523; 

Best Local Similarity 42.4%; Pred. No. l.le-81; 

Matches 224; Conservative 111; Mismatches 181; Indels 12; Gaps 4; 

Qy 1 MAGQRVLLLVGFLL PGVLLS EAAKI LT I STVGGSH YLLMDRVSQI LQDHGHNVTMLNHKR 60 

II I I : : I : : I INN: : I I I I I I I I : I : I I : : I III 
Db 1 MAVGRKSLILSLLIQHFVLLHGAKILTVCFLGGSHYLWMDEISRILHNNGQEVTMFLQIA 60 

Qy 61 GPFMPDFKKEEKSYQVTSWLAPEDHQREFKKSFDFFLEETLGGR — GKFENLLNVLEYLA 118 

: | | : : : | I : : I : I : : : : I I : I I : : : : | : : : : 

Db 61 DGLLPDYQMQESPYRLITWSLDKNYLKEFS EFFRDSKYNFKDCDELSSYLGLMTHFS 117 

Qy 119 LQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAI LSTSFGSLE 174 

II I : I I : I I I : I : : : : : I : I I I : : I I I I I I : I : : : I 

Db 118 RQCKMIFNQTSIMNLLKEEKYDLAVIDSFNPCTFLVSEKLGIPFIATHPFPVKSPWHS— 175 

Qy 175 FGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSR 234 

I : I I I I : I I : : I I I I I I I I : I I I I I : : : : : I I I : I : I I I II 
Db 176 -GIPNQLSYMPWQSQLTDHMDFFERVT^NVFMYIASAVLERKIYSLFDDVIEEHFPACSR 234 

Qy 235 PVXSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSG 294 

I I I II : II : I I I I : I : I I :: I I I I I : : I I : I I I : I : I 
Db 235 PSFEELYKKTALWMYLTDFTIEFPHPFFPNVLYIGGVLAKPAKPVSEELEDFIAQSGEHG 294 

Qy 295 FVXVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQC 354 

I : : I I I I II : I I I I I : I : : I I I I I : : I I I I : I I I I I I : : I : I 

Qb 295 FIIWFGSMVPSNPLTEFV7<EMNDGFSKIPQKVIWRYRISEWPKVXQLA 354 

Qy 355 SDLLAHPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMV^WAKKFGVSIQLKK 414 

: I I I I I I I I I I I I I I I I I I . I I I I I I I I I I I I : I I I :: I I I I : 
Db 355 NDLLGHPKARLLVTHGGVNSIQEAIYHGVPMVAIPLFFDQFDNAVRIKAKHLGTFIPKDQ 414 

Qy 415 LKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPY 474 

I I II I I : : : : : I I : : M hi II I | : : | | : : | : : : I I I I I I 

Db 415 LKAEKLANAI RDVIGGESYKNSAMHLSLIQRSQPFPKDQQIVl^WVEHIVKVGGTDHLIPY 474 

Qy 475 VFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

: I I I : : I I I I I I : I : : I : I Mil: I I I : I : 
Db 475 SYQQPLYQQYLLDVFLFVCVCVIGACYLTVKLLKMFIQKLCSFRKLKQ 522 



RESULT 9 
Q8NAW4 

ID Q8NAW4 PRELIMINARY; . PRT; 221 AA. 

AC Q8NAW4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein FLJ34658. 

OS Homo sapiens (Human) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; .Homo. 

OX NCBIJTaxI D= 9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T . , Irie R. , 

RA Wakamatsu A., Hayashi K. , Sato H., Nagai K., Kimura K. , Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T . , Tanaka T., Ishii S., 

RA Yamamoto J., Saito K. , Kawai Y., Isono Y., Nakamura Y. , Nagahari K. , 

RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M. , Shiratori A., 

RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H. , Sugawara M., 

RA Takahashi M. , Kanda K. , Yokoi T., Furuya T., Kikkawa E., Omura Y. , 

RA Abe K., Kamihara K., Katsuta N.,- Sato K., Tanikawa M. , Yamazaki M. , 

RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K. , 

RA Tanai H., Kimata M. , Watanabe M. , Hiraoka S., Chiba Y., Ishida S., 

RA Ono Y., Takiguchi S., Watanabe S., Yosida M. , Hotuta T., Kusano J., 

RA Kanehori K. , Takahashi-Fuj ii A. , Hara H., Tanase T . , Nomura Y w 

RA Togiya S., Komai F-, Hara R. , Takeuchi K. , Arita M. , Imose N., 

RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y. , Matsunawa H., Ichihara T. f Shiohata N., Sano S., 

RA Moriya S. f Momiyama H. f Satoh N., Takami S., Terashima Y. r Suzuki O. , 

RA Nakagawa S., Senoh A., Mizoguchi H. f Goto Y. , Shimizu F. , Wakebe H., 

RA Hishigaki H. f Watanabe T w Sugiyama A., Takemoto M. f Kawakami B., 

RA Yamazaki M., Watanabe K. , Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M. , Tashiro H., Tanigami A. , Fujiwara T., 

RA Ono T. f Yamada K. , Fujii Y. , Ozaki K. , Hirao M. , Ohmori Y., 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y. > Okamoto S. f 

RA Okitani R. , Kawakami T. f Noguchi S., Itoh.T-, Shigeta K. , Senba T. , 

RA Matsumura K. , Nakajima Y . , Mizuno T., Morinaga M., Sasaki M- , 

RA Togashi T., Oyama M. , Hata H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y. , Nakagawa K., 

RA Okumura K., Nagase T., Nomura N., Kikuchi H. , Masuho Y., Yamashita R. , 

RA Nakai K., Yada T w Nakamura Y., Ohara O., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; AK091977; BAC03783.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P:metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 

SQ SEQUENCE 221 AA; 24764 MW; ACB59BEF47ACD458 CRC64; 

Query Match 34.8%; Score 963;, DB 2; Length 221; 

Best Local Similarity 79.6%; Pred. No. 8e-68; 

Matches 176; Conservative 21; Mismatches 24; Indels 0; Gaps 0 

Qy 303 MWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVlilAANVKIVTWLPQSDLLAHPS 362 

I : I I I : I : I : I : I I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I I I I II 
Db 1 MLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSSHWPRDVHLATNVKIVDWLPQSDLLAHPS 60 



Qy 363 IRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLAL 422 



I I I I I I I I I I I I : I I I I : I I I I I I I : I : III Mill II : I I I I : I : : I : I I I 
Db 61 I RLFVTHGGQNSVMEAI RHGVPMVGLPWGDQHG^^V[V^*WAKNYGVS I RLNQVTADTLTL 120 



Qy 423 KMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHE 482 

I I I : : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I II I I I I I I I I I I I I I I I I 
Db 121 TMKQVIEDKRYKSAWAASVILHSQPLSPAQRLVGWIDHILQTGGATHLKPYAFQQPWHE 180 

Qy 4 83 QYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

III I I I I I I I I I I I I I : I I I I I I I I : I I I I II I I I : I 
Db 181 QYLIDVFVFLLGLTLGTMWLCGKLLGWARWLRGARKVKKT 221 

RESULT 10 
Q8IYS9 

ID Q8IYS9 PRELIMINARY; PRT; 252 AA. 

AC Q8IYS9; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E . , 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15 f 000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RA Strausberg R. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC035012; AAH35012.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 



DR Pfam; PF00201; UDPGT; 1. 
KW Hypothetical protein.. 

SQ SEQUENCE 252 AA; 29155 MW; 956CC9F9718B8982 CRC64; 

Query Match 33.0%; Score 914.5; DB 2; Length 252; 

Best Local Similarity 75.7%; Pred. No. 6.5e-64; 

Matches 171; Conservative 21; Mismatches 33; Indels 1; Gaps 1; 

Qy 58 HKRGPFM-PDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEY 116 

I : I I : I I I : I I I I I I I I I : I I I I I : I I I I : : I I I I : I I : : : I 

Db 3 HQSGKFLI PDI KEEEKS YQVI RWFS PEDHQKRI KKHFDS YI ETALDGRKESEALVKLMEI 62 

Qy 117 LALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFG 176 

III: I : I I I I I I I I I I I I : I : I II I I : I I I I I II I I I I I I I I I : I I I I : I I 
Db 63 FGTQCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVKPFVAILPTTFGSLDFG 122 

Qy 177 LPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPV 236 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 123 LPSPLSWPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQSTFDNTIKEHFPEGSRPV 182 

Qy 237 LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : 
Db 183 LSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPIKPVPQN 228 



RESULT 11 
Q8BRY7 

ID Q8BRY7 PRELIMINARY; PRT; 302 AA. 

AC Q8BRY7; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Mus musculus adult male aorta and vein cDNA, RIKEN full-length 

DE enriched library, clone :A530069C13 product : hypothetical UDP- 

DE glucoronosyl and UDP-glucosyl transferase containing protein, full 

DE insert sequence. 

GN Name=AI313915; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RX MEDLINE=99279253; PubMed=10349636; DOI=10 . 1016/S0076-6879 ( 99) 03004-9; 

,-RA Carninci P., Hayashizaki Y. ; 

RT "High-efficiency full-length cDNA cloning."; 

RL Meth. Enzymol. 303:19-44(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RX MEDLINE=21085660; PubMed=11217851 ; DOI=10 . 1038/35055500 ; 

RA RIKEN FANTOM Consortium; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [3] 

RP SEQUENCE FROM N.A. 



RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RX MEDLINE=20499374; PubMed=11042159 ; DOI=10 . 1101/gr . 145100; 

RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y. , Shibata K. , Itoh M. , 

RA Konno H., Okazaki Y., Muramatsu M. , Hayashizaki Y. ; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RX MEDLINE=2 05309 13; PubMed=11076861 ; DOI=10 . 1101/gr . 152600 ; 

RA Shibata K. , Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P., 

RA Konno H., Akiyama J., Nishi K. , Kitsunai T . , Tashiro H., Itoh M. , 

RA Sumi N., Ishii Y., Nakamura S., Hazama M. , Nishine T., Harada A., 

RA Yamamoto R. , Matsumoto H. , Sakaguchi S., Ikegami T., Kashiwagi K. , 

RA Fujiwake S., Inoue K. , Togawa Y. , Izawa M. , Ohara E., Watahiki M. , 

RA Yoneda Y. , Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., 

RA Okazaki Y., Muramatsu M. , Inoue Y., Kira A., Hayashizaki Y. ; 

RT "RIKEN integrated sequence analysis (RISA) system-384~f ormat 

RT sequencing pipeline with 384 multicapillary sequencer."; 

RL Genome Res. 10:1757-1771(2000). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta and vein; 

RA Adachi J., Aizawa K. , Akimura T., Arakawa T., Bono H., Carninci P., 

RA Fukuda S., Furuno M. , Hanagaki T., Hara A., Hashizume W., 

RA Hayashida K. , Hayatsu N., Hiramoto K. , Hiraoka T., Hirozane T., 

RA Hori F. , Imotani K., Ishii Y., Itoh M. , Kagawa I., Kasukawa T., 

RA Katoh H., Kawai J., Kojima Y., Kondo S., Konno H., Kouda M. , Koya S., 

RA Kurihara C, Matsuyama T. f Miyazaki A. , Murata M. , Nakamura M. , 

RA Nishi K., Nomura K. , Numazaki R. , Ohno M. , Ohsato N., Okazaki Y., 

RA Saito R., Saitoh H . , Sakai C, Sakai K. , Sakazume N . , Sano H., 

RA Sasaki D., Shibata K., Shinagawa A., Shiraki T., Sogabe Y., Tagami M. , 

RA Tagawa A. , Takahashi F., Takaku-Akahira S., Takeda Y., Tanaka T., 

RA Tomaru A. , Toya T., Yasunishi A. , Muramatsu M., Hayashizaki Y.; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AK041045; BAC30796.1; -. 

,DR MGD; MGI: 2145969; AI313915. 

DR GO; GO: 0016740; F: transferase activity; IEA. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

KW Hypothetical protein; Transferase. 

SQ SEQUENCE 302 AA; 35256 MW; C78A84C1D58987DC CRC64; 



Query Match 28.6%; Score 792; DB 2; Length 302; 

Best Local Similarity 57.9%; Pred. No. 3.9e-54; 

Matches 157; Conservative 37; Mismatches 75; Indels 2; Gaps 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I II: J I I : I I I I I I I I I I I : I I I : : : I I I I : I : I I I I I I : : 
Db 1 MAAHRRWLLMSFLFLEVILLEAAKILTISTLSASHYIVISRVSQVLHEGGHNVTKLLYES 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I I I : I I : I I I I I : I I I I : : : I II III I I : : I 

Db 61 T^J-IPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

II I : I I I I I I I I I I I I I : I : : : : I I I I I II I I I I : I I : : I I I I 
Db 120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 

Qy 180 PLSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I I I : I I I I I I II I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 
Db 180 PLSYAPWGSGLTDQMDFWGRVKNFLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 239 

Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVYVGG 270 

I I I I I I I I I : I I I I I I I II I I MINIM 
Db 240 LLLKAELWFVNSDFALDFARPLFPNTVYVGG 270 



RESULT 12 
UDC1_RABIT 

ID UDC1_RABIT STANDARD; PRT; 502 AA. 

AC P36514; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE UDP-glucuronosyltransf erase 2C1 microsomal (EC 2.4.1.17) (UDPGT) 

DE (Fragment) . 

GN Name=UGT2Cl; Synonyms =UGT2A2 ; 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=New Zealand white; TISSUE=Liver ; 

RX MEDLINE=933 15511; PubMed=8325897 ; 

RA Tukey R^H., Pendurthi U.R., Nguyen N.T., Green M.D., Tephly T.R.; 

RT "Cloning and characterization of rabbit liver UDP- 

RT glucuronosyltransf erase cDNAs . Developmental and inducible expression 

RT of 4-hydroxybiphenyl UGT2B13 . " ; 

RL J. Biol. Chem. 268:15260-15266(1993). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 

CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



NON TER 


1 


1 






TRANS MEM 


466 


481 


Potential . 




CARBOHYD 


111 


177 


N-linked (GlcNAc. , 


. .) (Potential) 


CARBOHYD 


288 


288 


N-linked (GlcNAc. , 


. .) (Potential) 


SEQUENCE 


502 AA; 


57449 


MW; B6E65670BFAE1D35 


CRC64; 



CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license. agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L01083; AAA18023.1; 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycoprotein; Glycosyl trans f erase; Microsome; Multigene family; 
KW Transferase; Transmembrane. 
FT 
FT 
FT 
FT 
SQ 

Query Match 27.0%; Score 748.5; DB 1; Length 502; 

Best Local Similarity 35.4%; Pred. No. 2.1e-50; 

Matches 180; Conservative 82; Mismatches 178; Indels 69; Gaps 14; 

Qy 34 SHYLLMDRVSQI LQDHGHNVT MLNHKRGPFMPDFKK EE 71 

I I : : : : : I I I I : I : I : I : I I : : II 

Db 7 SHWINLKVILEELQLRGHEITVLVPSPSLLLDHTKIPFNVEVLQLQVTKETLMEELNTVL 66 

Qy 72 — KSYQV — ISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNR 127 

I : : : : I I III : I I I I : 

Db 67 YMSSFELPTLSWWKVLGKMVEMGKQFS KNLRRV CDSAITN 106 

Qy 128 KDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLE FGLPIPLSY 183 

I : : : I I I I I : : : : I hill III I I : : I I I I I I 

Db 107 KELLDRLKAAKFDICLADPLAFCGELVAELLNIPFVYSFRFSIGNIIERSCAGLPTPSSY 166 

Qy 184 VPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQH-MQSTFDNTIKEHFTE — GSRPVLSHL 240 

II I I I I : I I I : I I : I : : I I I : I I : : : : II : : 

Db 167 VPGSTSGLTDNMSFVQRLKNWLLYLMNDMMFSHFMLSEWD EYYSKVLGRRTTICEI 222 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

: I I I : I I I : I : I I I I I I I I I I I I I I I : I : : I I I : I : I I : I I 
Db 223 MGKAEMWLIRSYWDFEFPRPFLPNFEYVGGLHCKPAKPLPEELEEFVQSSGNDGVWFTL 282 

Qy 301 GSMVNTCQNPEI FKEMNN AFAHLPQGVIWKCQCSHWPKDVllIJ^VKIVDWLPQSD 356 

III: II : : I : I I I : I I I : I : I I I : : : I : I I : I 

Db 283 GSMI QN — LTEERSNLI ASALAQI PQKVLWRYT GKKPATLGPNTRLFEWI PQND 334 

Qy 357 LLAHPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVT^V^AKKFGVSIQLKKLK 416 

I I I I 1-4:1 III I : III I I I I I I I I I I I I I I I : I : I I : I I I : I : : 

Db 335 LLGHPKTRAFITHGGTNGLYEAIYHGVPMVGIPLFGDQPDNIARVKAKGAAVDVDLRIMT 394 

Qy 417 AETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVF 476 

: I : | : : : | | | : I I I I I I I I I : I : : I I I I : 

Db 395 TSSLLKALKDVINNPSYKENAMKLSRIHHDQPLKPLDRAVFWIEFVMRHKGARHLRVAAH 454 

Qy 477 QQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I : I I I I I I I : : I I 

Db 455 DLTWFQYYS LDVWFLLTCVAT 1 1 FLAKK 483 



RESULT 
Q98TB5 



13 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RT 
RL 
RN 
RP 
RA 
RL 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
SQ 



Q98TB5 PRELIMINARY; PRT; 541 AA. 

Q98TB5; 

01-JUN-2001 (TrEMBLrel. 17, Created) 
01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
UDP-galactose ceramide galactosyltransf erase (EC 2.4.1.47). 
Name=CGT ; 

Gallus gallus (Chicken) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 
Gallus. 

NCBI_TaxID=9031; 
[1] 

SEQUENCE FROM N.A. 
Kapitonov D. ; 

"Molecular cloning and expression of ceramide galactosyltransf erases . 
Comparison with other glycosyltransf erases ." ; 

Thesis ( 1997 ), Medical College of Virginia, Richmond, VA, USA. 
[2] 

SEQUENCE FROM N.A. 
Kapitonov D. ; 

Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 
~!~ SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 
AAK16234.1; -. 

F:N-acylsphingosine galactosyltransf erase act. . .; IEA. 
F: transferase activity, transferring hexosyl . . .; IEA. 
P: metabolism; IEA. 
InterPro; IPR002213; UDP_glucos_trans . 
Pfam; PF00201; UDPGT; 1. 
PROSITE; PS00375; UDPGT; 1. 
Glycosyltransf erase; Transferase. 

SEQUENCE 541 AA; 61598 MW; DFF1CA4C69E7 8 ICC CRC64; 



EMBL; AF129809; 
GO; GO: 0047263; 
GO; GO: 0016758; 
GO; GO: 0008152; 



Query Match 26.3%; Score 727; DB 2; Length 541; 

Best Local Similarity 33.3%; Pred. No. 1.2e-48; 

Matches 166; Conservative 99; Mismatches 199; Indels 34; 



Gaps 



10; 



Qy 

Db 

Qy 

Db 

Qy 
Db 



13 LLPGVLLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEE 71 

I | : : | | | | : : : I I : : : I I I I I : I 

11 LWSAVGIARAAKIVWPPIMFESHLYIFKTLASALHDQGHQTVFLLSEGREIPPSNHYRL 70 

72 KS YQVI SWLAPEDHQREFKKS- FDFFLEETL GGRGKFENLLNVLEYLALQCSHFLN 126 

III I I I I I : : II | : : | : : : | : 

71 KRYPGI FNS ST.SDDFLQSKMRS I FSGRLTALELFDI LDH YSKNCDMI VG 119' 

127 RKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIPLSYVPV 186 

: : : I : I I I I I : : : I : : I I : I I II : : : I I I I I I I I I 

120 NQNLMHALKQEKFDLLLVDPNEMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPAPLSYVPE 179 



Qy 

Db 

Qy 



187 FRSLLTDHMDFWGRVKNFLMF FSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I : : I : I I : : II : : : : : : I I : I 

180 FNSLLTDRMNLFERMKNTFVYVISRFGVSFL VLPKYERIMQKHKVLPERSMYD-L 233 

241 LLKAELWFINSDFAFDFARPLLPNTWVGGLMEKPIKPVPQDLENFIAKFGDSGBVLVTL 300 
: : I I : : I I : I I I I I I I I I II : : I I I : I : I I : : : : : I I I I I : 



Db 



234 VHGSSLWMLCTDIALEFPRPTLP^^\AA!VGGILTKPASPLPEDLQTWWGANENGE T VLVSF 293 



Qy 


301 


Db 


294 


Qy 


361 


Db 


350 


Qy 


421 


Db 


410 


Qy 


481 


Db 


470 



GSMVNTCQNPEIFKEMNNAFAHLP 360 
I : I : : : : : I I I I I I I I : : I : I I I : : : I I I I : I I I I 
GAGVKYL-SEDVANKLARALARLPQRVIWRFSGN KPRNLGNNTKLIEWLPQNDLLGH 349 

PSI RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 420 
I : I : I :: I I I I I I I : I I I I : I I I I I I I I : I I I : I I I : : I : I 
PNIKAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQAKGMGILLNWKTVTESEL 409 

ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

: : : : : I I : I I I : I I I I I I : : : I : Mil: I : 
YEALEKVINDPSYRQRAQRLSEIHKDQPGHPVNRTVYWINYILRHNGAQHLRAAVYSISL 469 

HEQYLFDV- FVFLLGLTL 497 
: : : I I : I I I : I I 
YQYFLLDIAFWLVGAAL 487 



RESULT 14 
Q6UWM9 

ID Q6UWM9 PRELIMINARY; PRT; 527 AA. 

AC Q6UWM9; 

DT 05-JUL-2004 (TrEMBLrel. 27, Created) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last sequence update) 

DT 05-JUL-2004 (TrEMBLrel. 27, Last annotation update) 

DE RSDK2559 (UDP-glucuronosyltransf erase) . 

GN Name=UGT2A3; ORFNames=UNQ2559 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22887296; PubMed=12975309 ; DOI=10 . 1101/gr . 1293003 ; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K. , Baldwin D. , Brush J., 

RA Chen J., Chow B., Chui C, Crowley C, Currell B. , Deuel B., Dowd P.,- 

RA Eaton D. , Foster J., Grimaldi C, Gu Q. , Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., 

RA Lewis L., Liao D., Mark M., Robbie E., Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A., 

RA Vandlen R. , Watanabe C, Wieand D., Woods K., Xie M.H., Yansura D., 

RA Yi S . , Yu G . , Yuan J., Zhang M. Zhang Z., Goddard A. , Wood W.I., 

RA Godowski P.; 

RT "The secreted protein discovery initiative (SPDI) , a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinf ormatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RA Court M.H. ; 

RL Submitted (FEB-2004) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

DR EMBL; AY358727; AAQ89089.1; -. 

DR EMBL; AY542891; AAS48425.1; -. 

DR GO; GO:0016758; F: transferase activity, transferring hexosyl . . .; IEA. 



DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR006121; HeavyMe_transpt . 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS01047; HMA_1; UNKNOWN_l . 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase . 

SQ SEQUENCE 527 AA; 60284 MW; EB6C8F886B4DEC5E CRC64; 



Query Match 25.8%; Score 714.5; DB 2; Length 527; 

Best Local Similarity 33.9%; Pred. No. l.le-47; 

Matches 172; Conservative 93; Mismatches 208; Indels 35; Gaps 13; 



Qy 


34 


SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEK-SYQVISWLAPEDHQREFKKS 

1 1 : 1 : : : 1 1 1 1 1 : 1 1 : 1 : 1 : : 1 : : 1 : hi 1 = 
SHWLNVKVI LEELIVRGHEVTVLTHSK-PSLI DYRKPSALKFEWH — MPQDRTEENEIF 


92 


Db 


34 


90 


Qy 


93 


FDFFLEETLGGRGKFENLLNVLEY LALQCSHFLNRKDIMDSLKNENFDMVIVE 


145 


Db 


91 


| | || :::::::: | : | I : : 1 1 : | : | : : : : : 
VIJLAL-NVIjPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKLQETNYD 


149 


Qy 


146 


TFDYCPFLIAEKLGKPFVAILSTSF-GSLEFG LPIPLSYVPVFRSLLTDHMDFWGRV 

1 1 : 1 1 1 IN 1 I 1 : : 1 II 1 1 II 1 1 1 : 1 1 1 1 1 II 
PVIPCGDLMAELLAVPFVXTLRISVGGNMERSCGKLPAPLSYV^ 


201 


Db 


150 


209 


Qy 


202 


KNFLMFFSFCRRQQHMQSTFDNTIKEHFTE GSRPVLSHLLLKAELWFINSDFAFDFA 

||:: I : 1 II 1 1 : I I I : I I : : 1 : 1 
KNSMLSVLF HFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRTYWDFEFP 


258 


Db 


210 


265 




9 S Q 

c. D .7 


dot t PNTVYVf^ftT.MFKPTK'PVPOnT.FNFTAKFGDSGFVLVT 

: 1 II : 1 1 1 1 | | | : I : : : I I I : 1 : 1 1 : : 1 1 1 : II : : 1 1 

QPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGI VVFSLGSLF QN — VTEEKAN 


318 


Db 


266 


320 


Qy 


319 


AFAHLPQGVIWKCQCSHWPKDVlilW\NVKIvT)WLPQSDLiy^ 

I I : 1 1 1 : 1 : : 1 1 1 1 : : 1 1 : 1 1 : 1 1 1 1 1 : 1 : 1 1 1 1 1 
IIASALAQIPQKVLWRYK GKKPSTLGANTRLYDWIPQNDLLGHPKTKAFITHGGMNG 


374 


Db 


321 


377 


Qy 

Db 


375 
378 


IMEAIQHGVPMVGIPLFGDQPENMWVEAK^ 

1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 : 1 : : : 1 1 II l-ll : : : : I II 
IYEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYK 


434 
437 


Qy 


435 


SAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLG 

1 : II 1 : 1 | | | 1 : 1 :: MM: 1 : 1 II III 
ENAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLT 


494 


Db 


438 


497 


...Qy 


495 


LTLGTLWLCGKLLGMAWWLRGARKVKE 522 




Db 


498 


: : 1 1 : 1 1 : : : 
CVATAI FLFTKCFLFSCQKFNKTRKI EK 525. 





RESULT 15 
CGT HUMAN 



ID CGT_HUMAN STANDARD; PRT; 541 AA. 

AC Q16880; 000196; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 



DE 2-hydroxyacylsphingosine l-beta~galactosyltransf erase precursor 

DE (EC 2.4.1.45) (UDP-galactose-ceramide galactosyltransf erase ) (Ceramide 

DE UDP-galactosyltransf erase) (Cerebroside synthase) . 

GN Name=UGT8; Synonyms =CGT, UGT4; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96299661; PubMed=8661025 ; DOI=10 . 1006/geno . 1996 . 0242 ; 

RA Bosio A. , Binczek E. , Lebeau M.M., Fernald A. A., Stoffel W. ; 

RT "The human gene CGT encoding the UDP-galactose ceramide galactosyl 

RT transferase (cerebroside synthase) : cloning, characterization, and 

RT assignment to human chromosome 4, band q26."; 

RL Genomics 34:69-75(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97242209; PubMed=9125199 ; DOI=10 . 1006/bbrc . 1997 . 6240 ; 

RA Kapitonov D.E., Yu R.K.; 

RT "Cloning, characterization, and expression of human ceramide 

RT galactosyltransferase cDNA. " ; 

RL Biochem. Biophys. Res. Commun. 232:449-453(1997). 

CC -!- FUNCTION: Catalyzes the transfer of galactose to ceramide, a key 
CC enzymatic step in the biosynthesis of galactocerebrosides, which 

CC are abundant sphingolipids of the myelin membrane of the central 

CC nervous system and peripheral nervous system. 

CC -!- CATALYTIC ACTIVITY: UDP-galactose .+ 2- (2-hydroxyacyl) sphingosine = 
CC UDP + 1- (beta-D-galactosyl) -2- (2-hydroxyacyl) sphingosine. 

CC -!- PATHWAY: Galactocerebroside biosynthesis. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U30930; AAC50565.1; -. 

DR EMBL; U32370; AAC50815.1; -. 

DR EMBL; U31353; AAC50815.1; JOINED. 

DR EMBL; U31461; AAC50815.1; JOINED. 

DR EMBL; U31658; AAC50815.1; JOINED. 

DR EMBL; U31861; AAC50815.1; JOINED. 

DR EMBL; U62899; AAC51187.1; 

DR Genew; HGNC: 12555; UGT8 . 

DR MIM; 601291; -. 

DR GO; GO: 0008489; F: UDP-galactose-glucosylceramide beta-l, 4-gal . . .; TAS . 

DR GO; GO: 0007417; P: central nervous system development; TAS. 

DR GO; GO:0007422; P:peripheral nervous system development; TAS. 

DR InterPro; IPR002213; UDP_glucos_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycoprotein; Glycosyltransf erase; Microsome; Signal; Transferase; 

KW Transmembrane. 



FT 


SIGNAL 


1 


20 


Potential . 




FT 


CHAIN 


21 


541 


2-hydroxyacylsphingosine 1-beta- 


FT 








galactosyl transferase. 


FT 


TRANS MEM 


472 


492 


Potential . 




FT 


CARBOHYD 


78 


78 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


CARBOHYD 


333 


333 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


CARBOHYD 


442 


442 


N-linked (GlcNAc. 


. . ) (Potential) . 


FT 


CONFLICT 


99 


99 


T -> P (in Ref. 2) 




FT 


CONFLICT 


116 


116 


L -> M (in Ref. 2) 




FT 


CONFLICT 


356 


356 


L -> V (in Ref. 2) 




FT 


CONFLICT 


379 


379 


L -> V (in Ref. 2) 




SQ 


SEQUENCE 


541 AA; 


61455 MW 


; EC532798F7E15834 


CRC64; 


Query Match 




25.3%; 


Score 699; DB 1; 


Length 541; 



Best Local Similarity 32.2%; Pred. No. 1.9e-46; 

Matches 166; Conservative 95; Mismatches 185; Indels 70; Gaps 12; 

Qy 13 LLPGVTiLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I I : : : I I I I : : : II : : : I. : I I : I : I :" I 
Db 11 LWSAVGIAKAAKI I IVPPIMFESHMYI FKTLASALHERGHHTVFLLSEGRD 61 

Qy 71 EKS YQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : I III:: II I : : I : : 

Db 62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAIELFDILDHYTK 112 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : I I I I I : : : I : I . I I : I I II : : : III 
Db 113 NCDLMVGNHALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

Qy m 180 P LS YVP VFRS LLT DHMD FWGRVKN FLMFFS FCRRQQHM QSTFD 222 

I I : I I I I I I I I I I : I : I I I I : : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNLLQRMKNTGVYLISRLGVSFLVLPKYERIMQKYNLLPEKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I :: I I I : I : I 
Db 233 LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

Qy 283 

Db 276 

Qy 343 

Db 332 

Qy 403 AKKFGVS IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHV 462. 

II I: :: I : : I : ::: : |: | I I : | | | : |||:: 

Db 392 AKGMGILLEWKTVTEKELYEALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTIYWIDYI 451 

Qy 4 63 LQTGGATHLKPYVFQQPWHEQYLFDV- FVFLLGLTL 4 97 

: : II II: II : : : I I : I I I I I I 
Db 452 I RHNGAHHLRAAVHQI S FCQYFLLDI AFVLLLGAAL 4 87 



LENFIAKFGDSGFVIjWLGSMWTCQNPEIFKEM^NAFAHLPQGVIWKCQCSHWPKDVHL 342 
I : : : .: I I I I I : I : I : : I : : I III III: II : I 

LQRWWGANEHGFVXVSFGAGVKYL-SEDIANKLAGALGRLPQKVIWRFS GPKPKNL 331 

< 

AANVKIVTDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMWV^ 402 

I I : : : I II I : I I I I I : I : : I I I III I : I I I I : I I I I I I I I : I I I : 
GNNTKLIEWLPQNDLLGHSKIKAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQ 391 



Search completed: February 15, 2005, 12:57:51 
Job time : 150 sees 



